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Preface to the 2nd Edition 


Particle physics and cosmology enjoyed rapid development between the first and 
the second editions of this book. Experimental research resulted in a wealth of 
new data, which forced us to correct and in some places rewrite chapters on dark 
matter, phase transitions and baryon asymmetry of the Universe. We have also made 
substantial revisions in other chapters. The numerical values of particle physics and 
cosmological parameters have been updated in accordance to contemporary data. 

Besides that, we have corrected numerous misprints and drawbacks that existed 
in the first edition. We are indebted to our numerous colleagues for their input, and 
also students at the Department of Particle Physics and Cosmology, which has been 
recently created at the Physics Faculty of the Lomonosov Moscow State University 
(http://ppe.inr.ac.ru). 
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Preface to the 1st Edition 


It is clear by now that there is a deep interconnection between cosmology and par- 
ticle physics, and between macro- and micro-worlds. This book is written precisely 
from this perspective. We present here the results on the homogeneous and isotropic 
Universe at the hot stage of its evolution and at subsequent stages. This part of cos- 
mology is often dubbed as the Hot Big Bang theory. In the accompanying book we 
study the theory of cosmological perturbations (inhomogeneities in the Universe), 
inflationary theory and the theory of post-inflationary reheating. 

This book grew from the lecture course which had been taught for a number 
of years at the Department of Quantum Statistics and Field Theory of the Physics 
Faculty of the Lomonosov Moscow State University. This course is aimed at under- 
graduate students specializing in theoretical physics. We decided, however, to add a 
number of more advanced Chapters and Sections which we mark by asterisks. The 
reason is that there are problems in cosmology (nature of dark matter and dark 
energy, mechanism of the matter-antimatter asymmetry generation, etc.) which 
have not found their compelling solutions yet. Most of the additional material deals 
with hypotheses on these problems that at the moment compete with each other. 

Knowledge of material taught in general physics courses is in principle sufficient 
for reading the main Chapters of this book. So, the main Chapters must be under- 
standable by undergraduate students. The necessary material on General Relativity 
and particle physics is collected in the Appendices which, of course, do not pretend 
to give comprehensive account of these areas of physics. On the other hand, some 
parts labeled by asterisks make use of the methods of classical and quantum field 
theory as well as nonequilibrium statistical mechanics, so basic knowledge of these 
methods is required for reading these parts. 

Literature on cosmology is huge, and presenting systematic and comprehensive 
bibliography would be way out of the scope of this book. To orient the reader, at 
the end of this book we give a list of monographs and reviews where the issues we 
touch upon are considered in detail. Certainly, this list is by no means complete. We 
occasionally refer to original literature, especially in those places where we present 
concrete results without detailed derivation. 


x Preface 


Both observational cosmology and experimental particle physics develop very 
fast. Observational and experimental data we quote, the results of their compilations 
and fits (values of the cosmological parameters, limits on masses and couplings of 
hypothetical particles, etc.) will most probably get more precise even before this 
book is published. This drawback can be corrected, e.g., by using the regularly 
updated material of Particle Data Group at http://pdg.lbl.gov/. 

We would like to thank our colleagues from the Institute for Nuclear Research of 
the Russian Academy of Sciences — F. L. Bezrukov, S. V. Demidov, V. A. Kuzmin, 
D. G. Levkov, M. V. Libanov, E. Y. Nugaev, G. I. Rubtsov, D. V. Semikoz, 
P. G. Tinyakov, I. I. Tkachev and S. V. Troitsky for participation in the prepa- 
ration of the lecture course and numerous helpful discussions and comments. Our 
special thanks are to S. L. Dubovsky who participated in writing this book at an 
early stage. We are deeply indebted to V. S. Berezinsky, A. Boyarsky, A. D. Dolgov, 
D. I. Kazakov, S. Y. Khlebnikov, V. F. Mukhanov, I. D. Novikov, K. A. Postnov, 
M. V. Sazhin, M. E. Shaposhnikov, A. Y. Smirnov, A. A. Starobinsky, R. A. Sun- 
yaev, A. N. Tavkhelidze, O. V. Verkhodanov, A. Vilenkin, M. B. Voloshin and 
M. I. Vysotsky for many useful comments and criticism. 
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Chapter 1 


Cosmology: A Preview 


The purpose of this Chapter is to give a preview of the field which we consider in 
this and the accompanying book. The presentation here is at the qualitative level, 
and is by no means complete. Our purpose is to show the place of one or another 
topic within the entire area of cosmology. 

Before proceeding, let us introduce units and conventions that we use throughout 
this book. 


1.1. Units 


We mostly use the “natural” system of units in which the Planck constant, speed 
of light and the Boltzmann constant are equal to 1, 


h=c=k,= 1. 


Then the mass M, energy E and temperature T have the same dimension (since 
[E] = [me], [E] = [ksT]). A convenient unit of mass and energy is 1eV or 1 GeV = 
10° eV; the proton mass is then equal to mp = 0.938 GeV, and 1K is approximately 
10-13 GeV. Time ¢ and length / in the natural system have dimension M~! (since 
[E] = [hw], [v] = [t71] and [2] = [ct]), with 1GeV~! ~ 10-'4cm and 1GeV7! ~ 
107?4s. We give the coefficients relating various units in Tables 1.1 and 1.2. 

Problem 1.1. Check the relations given in Tables 1.1 and 1.2. What are 1 Volt (V), 


o 


1 Gauss (G), 1 Hertz (Hz) and 1 Angström (A) in natural system of units? 


In natural system of units, the Newton gravity constant G has dimension M~?. 
This follows from the formula for the gravitational potential energy V = -G™72, 
since [V] = M, [r~'] = M. It is convenient to introduce the Planck mass Mp, in 
the following way, 


G = M7’. 
Numerically 


Mp, = 1.2- 1079 GeV, (1.1) 
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Table 1.1. Conversion of natural units into CGS units. 


Energy 1 GeV = 1.6 -10~% erg 

Mass 1GeV = 1.8 - 107?4 g 
Temperature 1 GeV = 1.2- 1013 K 

Length 1 GeV! = 2.0 - 10714 cm 
Time 1 GeV} = 6.6 107? s 
Particle number density 1 GeV? = 1.3- 1044 cm~? 
Energy density 1 GeV4 = 2.1 - 1038 erg - cm73 
Mass density 1GeV4 = 2.3 - 1017 g -cm7 


Table 1.2. Conversion of CGS units into natural units. 


Energy 1 erg = 6.2 - 10? GeV 

Mass 1g = 5.6 - 1023 GeV 
Temperature 1K = 8.6 - 10714 GeV 

Length 1cm = 5.1 - 1013 GeV-! 

Time 1s = 1.5 - 10?4 GeV-! 

Particle number density 1 cm7? = 7.7 - 1074? GeV3 
Energy density lerg- cm7? = 4.8 - 10799 GeV4 
Mass density 1g: cm7? = 4.3 - 10718 GeV4 


and the Planck length, time and mass are 


1 


lpi = = 1.6 - 107°’ cm, 
1 —44 

ter = qy 554 107s, (1.2) 
PI 


Mp = 2.2 -107% g. 
The gravitational interactions are weak precisely because Mp, is large. 
Problem 1.2. Check the relations (1.1) and (1.2). 


Problem 1.3. What is the ratio of gravitational interaction energy to Coulomb 
energy for two protons? 


The traditional unit of length in cosmology is megaparsec, 
1Mpc = 3.1 - 1074 cm. 


Let us also introduce a convention which we use in this book. The subscript 0 
denotes present values of quantities which can depend on time. As an example, p(t) 
denotes the energy density in the Universe as a function of time, while po = p(to) 
is always its present value. 


There are several units of length that are used in astronomy, depending on sizes of objects 
and length scales considered. Besides the metric system, in use are astronomical unit (a.u.), 
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which is the average distance from the Earth to the Sun, 
lau. = 1.5 - 10” cm; 
light year, the distance that a photon travels in one year, 


lyear = 3.16- 107s, light year = 3 - 19° -3.16 - 107 s = 0.95 - 10'8 cm; 


and parsec (pc) — distance from which an object of size 1 a.u. is seen at angle 1 arc second, 


1pc = 2.1 - 10° a.u. = 3.3 light year = 3.1 - 10’? cm. 


To illustrate the hierarchy of spatial scales in the Universe, let us give the distances to 
various objects expressed in the above units. 

10a.u. is the average distance to Saturn, 30a.u. is the same for Pluto, 100 a.u. is the 
estimate of maximum distance which can be reached by solar wind (particles emitted by 
the Sun). 100 a.u. is also the estimate of the maximum distance to cosmic probes (Pioneer 
10, Voyager 1, Voyager 2). Further out is the Oort cloud, the source of the most distant 
comets, which is at the distance of 104—10° a.u. ~0.1-1 pc. 

The nearest stars — Proxima and Alpha Centauri — are at 1.3pc from the Sun. 
The distance to Arcturus and Capella is more than 10 pc, the distances to Canopus and 
Betelgeuse are about 100pc and 200 pc, respectively; Crab Nebula — the remnant of 
supernova seen by naked eye — is 2kpc away from us. 

The next point on the scale of distances is 8kpc. This is the distance from the Sun to 
the center of our Galaxy. Our Galaxy is of spiral type, the diameter of its disc is about 
30 kpc and the thickness of the disc is about 250 pc. The distance to the nearest dwarf 
galaxies, satellites of our Galaxy, is about 30 kpc. Fourteen of these satellites are known; 
the largest of them — Large and Small Magellanic Clouds — are 50 kpc away. Search for 
new, dimmer satellite dwarfs is underway; we note in this regard that only eight of Milky 
Way satellites were known by 1994. 

The mass density of the usual matter in usual (not dwarf) galaxies is about 10° higher 
than the average over the Universe. 

The nearest usual galaxy—the spiral galaxy M31 in Andromeda constellation — is 
800 kpc away from the Milky Way. Despite the large distance, it occupies a sizeable area 
on the celestial sphere: its angular size is larger than that of the Moon! Another nearby 
galaxy is in the Triangulum constellation. Our Galaxy together with the Andromeda and 
Triangulum galaxies, their satellites and other 35 smaller galaxies constitute the Local 
Group, the gravitationally bound object consisting of more than 50 galaxies. 

The next scale in this ladder is the size of clusters of galaxies, which is 1-3 Mpc. 
Rich clusters contain thousands of galaxies. The mass density in clusters exceeds the 
average density over the Universe by a factor of a hundred and even sometimes a thousand. 
The distance to the center of the nearest cluster, which is in the Virgo constellation, is 
about 15 Mpc. Its angular size is about 5 degrees. Clusters of galaxies are the largest 
gravitationally bound systems in the Universe. 
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We begin our preview with the brief discussion of the properties of the present 
Universe (more precisely, of its observable part). 
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1.2.1. Homogeneity and isotropy 


The Universe is homogeneous and isotropic at large spatial scales. The sizes of the 
largest structures in the Universe — superclusters of galaxies and gigantic voids — 
reach! tens of megaparsec. At scales exceeding 200 Mpc, all parts of the Universe 
look the same (homogeneity). Likewise, there are no special directions in the Uni- 
verse (isotropy). These facts are well established by deep galaxy surveys which 
collected data on more than a million galaxies. 


About 20 superclusters are known by now. The Local Group belongs to a supercluster with 
the center in the direction of Virgo constellation. The size of this supercluster is about 
30 Mpc, and besides the Virgo cluster and Local Group it contains about a hundred groups 
and clusters of galaxies. Superclusters are rather loose: the density of galaxies in them is 
only twice higher than the average in the Universe. The nearest to Virgo is the supercluster 
in the Hydra and Centaurus constellations; its distance to the Virgo supercluster is about 
half a hundred megaparsec. 

The largest catalog of galaxies and quasars up to date is the freely available catalog 
of SDSS [2] (Sloan Digital Sky Survey). This catalog is the result of the analysis of the 
data collected during almost 8 years of operation of a dedicated telescope, 2.5 meters in 
diameter, which is capable of measuring simultaneously spectra of 640 astrophysical objects 
in 5 optical bandpasses (photon wavelengths A = 3800—9200 A). The catalog includes 
millions celestial objects. Most of the data has been processed by now; measurements of 
spectra of more than 1.8 million galaxies and more than 300 thousand quasars resulted 
in the creation of a 3-dimensional map covering a large part of the visible Universe. Its 
area exceeds a quarter of the sky. There are other catalogs which cover smaller parts of 
the Universe (see, e.g., Ref. [3] for the next-to-largest catalog based on the 2dF Galaxy 
Redshift Survey). 

The early SDSS results are illustrated in Fig. 13.1 in color pages, where positions 
of 40 thousand galaxies and 4 thousand quasars are presented. The covered part of the 
celestial sphere has the area of 500 squared degrees. Recognizable are clusters of galaxies 
and voids. Isotropy and homogeneity of the Universe are restored at spatial scales of order 
100 Mpc and larger. Color of each dot refers to the type of the astrophysical object. 
The domination of one type over others is, generally speaking, caused by peculiarities of 
structure formation and evolution. Thus, what one observes is partially pictured in time 
rather than in space. 

Indeed, from the distance of 1.5 Gpc, where the distribution of bright red elliptical galaxies 
(red dots in Fig. 13.1) is at maximum, light travels to the Earth for about 5 billion years. At 
that epoch, the Universe was different (for instance, there was no Solar system yet). One 
more reason for choosing objects of a certain type is the finite sensitivity of a telescope. In 
particular, only highly luminous objects can be detected at the largest distances, while the 
highest-luminosity, continuously shining objects in the Universe are quasars. 


1.2.2. Expansion 


The Universe expands: the distances between galaxies increase.” Loosely speaking, 
the space, being always homogeneous and isotropic, stretches out. 


1This is a somewhat loose statement: most accurate estimates are obtained from the galaxy cor- 
relation function, which falls off as power-law at large separations. 
2Of course, this does not apply to galaxies that are gravitationally bound to each other in clusters. 
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To describe this expansion, one introduces the scale factor a(t) which grows in 
time. The distance between two far away objects in the Universe is proportional to 
a(t) and the number density of particles decreases as a~3(t). The rate of the cosmo- 
logical expansion, i.e., the relative growth of distances in unit time, is characterized 
by the Hubble parameter, 


Hereafter, the dot denotes the derivative with respect to the cosmic time t. The 
Hubble parameter depends on time; its present value, according to our convention, 
is denoted by Ho. 

The expansion of the Universe gives rise also to the growth of the wavelength 
of a photon emitted in distant past. Like other distances, the photon wavelength 
increases proportionally to a(t); the photon experiences redshift. This redshift z is 
determined by the ratio of photon wavelengths at absorption and emission, 


Aab 
Aem 


=1+z. (1.3) 


Clearly, this ratio depends on the moment of the emission (assuming that the photon 
is detected today on the Earth), i.e., on the distance to the source. Redshift is 
a directly measurable quantity: the wavelength at emission is determined by the 
physics of the emission process (e.g., by energies of an excited and the ground state 
of an atom), while Aap is the measured wavelength. Thus, one identifies the system 
of emission (or absorption) lines and determines how much they are shifted to the 
red spectral region, and in this way one measures the redshift. 

In reality, the identification of lines makes use of patterns which are characteristic 
of particular objects, see Fig. 1.1, Ref. [5]. If the spectrum contains absorption dips, 
as in Fig. 1.1, then the object whose redshift is being measured is between the 
emitter and observer. The peaks in the spectrum — emission lines — mean that 
the object is an emitter itself. 

For z < 1, the distance to the source r and the redshift are related by the 
Hubble law 


z= Hor, z<1. (1.4) 


At larger z the redshift-distance relation is more complicated, which we will discuss 
in detail in this book. 

The determination of absolute distances to far away sources is a complicated 
problem. One of the methods is to measure the photon flux from a source whose 
absolute luminosity is assumed to be known. These sources are sometimes called 
standard candles. 


3Photons of definite wavelengths experience resonant absorption by atoms and ions, with subse- 
quent isotropic emission. This leads to the loss of photons reaching the observer. 
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Fig. 1.1. Absorption lines of distant galaxies [5]. The upper panel shows the measurement of the 
differential energy flux from a far away galaxy (z = 2.0841). The vertical lines show the position of 
atomic lines whose identification has been used to measure redshift. The spectra of nearer galaxies 
have more pronounced dips. The plot with the spectra of these galaxies, shifted to comoving frame, 
is shown in the lower panel. 


Systematic uncertainties in the determination of Ho were not particularly well 
known until recently and they are still fairly large. We note in this regard that the 
value of the Hubble constant as determined by Hubble in 1929 was 550 km/ (s - Mpc). 
The contemporary determinations give [1] 


km 
s- Mpc 


Ho = (67.3 + 1.2) (1.5) 


Problem 1.4. Relate the dimensionless redshift to distance expressed in Mpc. 


Let us comment on the traditional unit for the Hubble parameter used in (1.5). 
A naive interpretation of the Hubble law (1.4) is that the redshift is caused by 
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the radial motion of galaxies from the Earth with velocities proportional to the 
distances, 


v=AHor, v<l. (1.6) 


Then the redshift (1.4) is interpreted as the longitudinal Doppler effect (at v < c, 
i.e., v < 1 in natural units, the Doppler shift equals to z = v). According to this 
interpretation, the dimension of the Hubble parameter Hp is [velocity /distance]. We 
stress, however, that the interpretation of the cosmological redshift in terms of the 
Doppler effect is unnecessary, and often inadequate. The right way is to use the 
relation (1.4) as it is. 


Problem 1.5. Consider a system of many particles in Newtonian mechanics. Show 
that it is spatially homogeneous and isotropic if and only if the density of the par- 
ticles is constant over space, and the relative velocity of each pair of particles i and 
j is related to the distance between them by the “Hubble law” 

Vij = Horij, 
where Ho is independent of spatial coordinates. Hereafter, bold face letters denote 


three-vectors, v = (U1, V2, v3). 


The quantity Ho is usually parameterized in the following way, 


(1.7) 


where h is a dimensionless parameter of order one (see (1.5)), 


h = 0.673 + 0.012. 


We use the value h = 0.7 in numerical estimates throughout this book. 


One type of objects used for measuring the Hubble parameter are Cepheids, stars of 
variable brightness whose variability is related to absolute luminosity in a known way. 
This relationship is measured by observing Cepheids in compact systems like Magellanic 
Clouds. Since Cepheids in one and the same system are, to good approximation, at the 
same distance from us, the ratio of their visible brightness to absolute luminosity is the 
same for every star. The periods of Cepheid pulsations range from a day to tens of days, 
and during this period the brightness varies within an order of magnitude. The results 
of observations show that there is indeed a well-defined relation between the period and 
luminosity: the longer the period, the brighter the star. Hence, Cepheids serve as standard 
candles. 

Cepheids are giants and super-giants, so they are visible at large distances from our 
Galaxy. By measuring their spectra, one finds redshift of each of them, and by measur- 
ing the period of pulsations one obtains the absolute luminosity and hence the distance. 
Using these data, one measures the Hubble constant in (1.4). Figure 1.2 shows the Hubble 
diagram — redshift-distance relation — obtained in this way [10]. 

Besides Cepheids, there are other objects used as standard candles. These include, in 
particular, supernovae of type Ia. The determination of the Hubble parameter from the 
observations of remote standard candles is shown in Fig. 1.3. 


8 Cosmology: A Preview 


Hubble Diagram for Cepheids (flow—corrected) 


2000 


\ 
| 


1500 


1000 


Velocity (km/sec) 


0 10 20 
Distance (Mpc) 


wo 
lo) 


Fig. 1.2. Hubble diagram for Cepheids [10]. The solid line shows the Hubble law with the Hubble 
constant Ho = 75km/(s- Mpc), as determined from these observations. The dashed lines show the 
uncertainty in the determination of the Hubble parameter. 
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Fig. 1.3. Hubble diagram for remote standard candles including supernovae of type Ia [10]. The 
solid line shows the Hubble law with the value of the Hubble parameter Ho = 72 km/(s- Mpc) as 
determined from these data. Dashed lines correspond to experimental uncertainty in the Hubble 
parameter. 
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1.2.3. Age of the Universe and size of its observable part 
The Hubble parameter in fact has dimension [t~'], so the present Universe is char- 
acterized by the time scale 
1 1, 1 s-Mpe 
° h 100 km 
and the scale of distances 


= h1- 3.10" s = h™! - 10"? yrs 


Hy,’ = h™' - 3000 Mpc , (1.8) 

which gives, for h = 0.7, 
Hg * = 1.4.10" yrs (1.9a) 
= 4.3 - 10° Mpc. (1.9b) 


Crudely speaking, all distances in the Universe will become twice larger in about 10 
billion years; galaxies at distance of order 3 Gpc from us move away with velocities 
comparable to the speed of light. We will see that the time scale Ho l gives the 
order of magnitude estimate for the age of the Universe, and the distance scale Hy ' 
is roughly the size of its observable part. We will discuss the notions of the age 
and size of the observable part in the course of presentation, and here we point out 
that bold extrapolation of the cosmological evolution back in time (made according 
to the equations of classical General Relativity) leads to the notion of the Big 
Bang, the moment at which the classical evolution begins. Then the age of the 
Universe is the time passed from the Big Bang, and the size of the observable part 
(horizon size) is the distance travelled by signals emitted at the Big Bang and 
moving at the speed of light (more accurate estimate of the horizon size is 15 Mpc). 
We note in passing that the actual size of our Universe is larger, and most probably 
much larger than the horizon size; the spatial size of the Universe may be infinite 
in General Relativity. 


Irrespective of the cosmological data, there exist observational lower bounds on the age of 
the Universe to. Various independent methods give similar bounds at the level 


to > 13billion years = 1.3 - 10'° yrs. (1.10) 


One of these methods makes use of the distribution of luminosities of white dwarfs. White 
dwarfs are compact stars of high density, whose masses are similar to the solar mass. They 
slowly cool down and get dimmer. There are white dwarfs of various luminosities in the 
Galaxy, but the number of them sharply drops off below a certain luminosity. This means 
that there is a maximum age of white dwarfs, which, of course, should be smaller than the 
age of the Universe. This maximum age is found from the energy balance of a white dwarf 
(see, e.g., Ref. [12]). In this way the bound like (1.10) is obtained. 

Other methods include the studies of the radioactive element abundances in the Earth 
core, in meteorites (see, e.g., Ref. [13]), and in the metal-poor* stars (e.g., Ref. [14]), 
the comparison (e.g., Ref. [15]) of the stellar evolution curve for main-sequence stars 


4The term “metals” in astrophysics is used for all elements heavier than helium. 


10 Cosmology: A Preview 


on the Herzsprung—Russel diagram (luminosity-color or brightness-temperature) with the 
abundance of the oldest stars in metal-poor globular clusters,” the analysis of relaxation 
processes in stellar clusters, measurement of the abundance of hot gas in clusters of 
galaxies, etc. 


1.2.4. Spatial flatness 


Homogeneity and isotropy of the Universe do not imply, generally speaking, that at 
each moment of time the 3-dimensional space is Euclidean, i.e., that the Universe has 
zero spatial curvature. Besides the 3-plane (3-dimensional Euclidean space), there 
are two homogeneous and isotropic spaces, 3-sphere (positive spatial curvature) and 
3-hyperboloid (negative curvature). A fundamental observational result of recent 
years is the fact that the spatial curvature of our Universe is very small, if not exactly 
zero. Our 3-dimensional space is thus Euclidean to a very good approximation. We 
will repeatedly get back to this statement, both for quantifying it and for explaining 
which observational data set bounds on the spatial curvature. We only note here 
that the main source of these bounds is the study of the temperature anisotropy of 
the Cosmic Microwave Background (CMB), and that at the qualitative level, these 
bounds mean that the radius of spatial curvature is much greater than the size of 
the observable part of the Universe, i.e., much greater than Hg t, 


We note here that CMB data are also consistent with the trivial spatial topology. If our 
Universe had compact topology (e.g., topology of 3-torus) and its size were of the order 
of the Hubble length, CMB temperature anisotropy would show a certain regular pattern. 
Such a pattern is absent in measured anisotropy, see Ref. [11]. 


1.2.5. “Warm” Universe 


The present Universe is filled with Cosmic Microwave Background (CMB), gas of 
non-interacting photons, which was predicted by the Hot Big Bang theory and 
discovered in 1964. The number density of CMB photons is about 400 per cubic 
centimeter. The energy distribution of these photons has thermal, Planckian spec- 
trum. This is shown in Fig. 1.4 [16]. The CMB temperature is [1] 


To = 2.7255 + 0.0006 K. (1.11) 


The temperature of photons coming from different directions on celestial sphere is 
the same at the level of better than 1074 (modulo dipole component, see below); 
this is yet another evidence for homogeneity and isotropy of the Universe. 

Still, the temperature does depend on the direction in the sky. The angular 
anisotropy of the CMB temperature has been measured, as shown in Fig. 1.5 [38] 
(see Fig. 13.2 on color pages). It is of order ôT /To ~ 1074—1075. 


5Globular clusters are structures of sizes of order 30 pc inside galaxies; they can contain hundreds 
of thousand and even millions of stars. 
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Fig. 1.4. CMB spectrum. The compilation of the data is made in Ref. [16]. The solid line shows 
the Planckian (black body) spectrum. 
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Fig. 1.5. WMAP data [38]: angular anisotropy of CMB temperature, i.e., variation of the tem- 
perature of photons coming from different directions in the sky; see Fig. 13.2 for color version. The 
average temperature and dipole component are subtracted. The observed variation of temperature 
is at the level of ôT ~ 100 pK, i.e., ôT /To ~ 1074—1075. 


We will repeatedly come back to CMB anisotropy and polarization, since, on 
the one hand, they encode a lot of information about the present and early Universe 
and, on the other hand, they can be measured with high precision. 

Let us note that the existence of CMB means that there is a special reference 
frame in our Universe: this is the frame in which the gas of photons is at rest. 
The solar system moves with respect to this frame towards Hydra constellation. 
The velocity of this motion determines the dipole component of the measured CMB 
anisotropy [18], 


ôT dipole = 3.346 mK. (1.12) 
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Problem 1.6. Making use of the value of the dipole component, estimate the veloc- 
ity of motion of the Solar system with respect to CMB. 


Problem 1.7. Estimate the seasonal modulation of the CMB anisotropy caused by 
the motion of the Earth around the Sun. 


The present Universe is transparent to the CMB photons:® their mean free path 
well exceeds the horizon size Hy 1. This was not the case in the early Universe, when 
photons actively interacted with matter. 


Problem 1.8. Greisen—Zatsepin—Kuzmin effect [20, 21]. Interaction of pho- 
ton with proton at sufficiently high energies may lead to the absorption of photon and 
creation of m-meson. Let the cross-section of the latter process in the center-of-mass 
frame be (in fact, this is a pretty reasonable approximation for this problem) 


0 at /s<ma 
o= i 
0.5 mb at ys > ma 


where „/s is the total energy of photon and proton, ma = 1200MeV (A is the 
resonance mass), 1mb = 107? cm?. 

Find the mean free path of a proton in the present Universe with respect to this 
process as a function of proton energy. At what distance from the source does proton 
lose 2/3 of its energy? Ignore all photons (e.g., emitted by stars), except for CMB. 


Since the CMB temperature T depends on the direction n of celestial sphere, it is 
convenient to perform its decomposition over spherical harmonics Yım (n). The latter 
form the basis of functions on a sphere, and the decomposition is the closest analog 
of the Fourier decomposition. The temperature fluctuation ôT in the direction n is 
conveniently defined as 


ôT (n) = T(n) _ To = ôTdipole 
and its decomposition is 


ôT (n) = 5 lm Yim), 
l,m 


where the coefficients @j,m obey af m = (—1)'™ai,-m, So that temperature is real. The 
multipoles l correspond to fluctuations of characteristic angular size m/l. The current 
measurements are capable of studying angular scales ranging from the largest ones 
to less than 0.1° (1 ~ 1000, see Fig. 1.6 [39]). 


6This is not completely true in some regions of the Universe. As an example, photons scatter 
off hot gas (T ~ 10 keV) in clusters of galaxies and gain some energy. Thus, CMB is warmer in 
the directions towards clusters. This is called the Sunyaev—Zeldovich effect [19]. It is small but is 
measured in observations. 
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Fig. 1.6. CMB angular anisotropy as measured by Planck experiment [39]. The theoretical curve 
is obtained within the ACDM model (see Chapter 4); finite width of this curve (shadow) illustrates 
cosmic variance, which is due to the fact that only one (our) Universe is observed. 


The observational data are consistent with the property that temperature fluc- 
tuations ôT (n) are Gaussian random field, i.e., that the coefficients aj. are statis- 
tically independent for different | and m, 


(alma m) = Crm Omm’ 5 (1.13) 


where brackets mean averaging over an ensemble of Universes like ours. The coef- 
ficients Cj, do not depend on m in isotropic Universe, Cim = C1. They determine 
the correlation of temperature fluctuations in different directions, 


(6T(n1)5T(m2)) = X` ZEL c1Pi(cos 0), 
l 


where P, are the Legendre polynomials, functions of the angle 0 between the vectors 
nı and nə. In particular, the temperature fluctuation is 


2141 U(1 +1) 
2 = wy l L. 
(ôT?) 3 y^ / i din 


Thus, the quantity Dı = Do, determines the contribution to the fluctuation of 
a decimal interval of multipoles. It is this quantity that is shown in Fig. 1.6. 

It is important that the measurement of the CMB anisotropy gives not just a 
number, but a large set of data, the values of C; for different l. This set is determined 
by numerous parameters of the present and early Universe, hence its measurement 
provides a lot of cosmological information. Additional information comes from the 
measurement of CMB polarization. 
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1.3. Energy Balance in the Present Universe 


A dimensional estimate of the energy density in the Universe may be obtained 
in the following way. Given the energy density po, the density of “gravitational 
charge” is of order Go. Since the dynamics of the Universe is governed by gravity, 
the “charge” Gpo must somehow be related to the present expansion rate. The 
“charge” has dimension of M?; the same dimension as Hj. This suggests that po ~ 
HéG~' = M?2_,H3. Indeed, we will see that the present energy density in a spatially 
flat Universe is given by 


3 
pe = — H? M2.. 
8m 
With precision better than 1%, this is the energy density in our Universe today.” 
Numerically 


pe = 1.88 - 10729? È 5 = 0.53- 1075 S at h=0.7. (1.14) 
m 


According to the data of cosmological observations which we will discuss in due 
course, the contribution of baryons (protons, nuclei) into the total present energy 
density isë about 4.6%, 


Qs = = = 0.046. 
Pe 
The contribution of relic neutrinos of all types is even smaller; the cosmological 
bound is 


= È Pri < 0.0055, (1.15) 
Pe 


where the sum runs over the three species of neutrinos Ve, Vu, Vr and anti-neutrinos 
De, Du, Dr. We emphasize that there is still no cosmological evidence for the neu- 
trino mass; it is rather likely that the neutrino contribution is quite a bit smaller 
than the right-hand side of Eq. (1.15). Other known stable particles give negligible 
contribution to the present total energy density. Thus, the dominating material in 
the present Universe is something unknown. 

This “something unknown” in fact consists of two fractions, one of which is 
capable of clustering, and another is not. The former component is called “dark 
matter”. Its contribution to the energy density is about 25%. 

We will discuss the results (Big Bang Nucleosynthesis, CMB anisotropy, 
structure formation) which show that dark matter cannot consist of known particles. 
Most probably it is made of new stable particles which were non-relativistic in very 
distant past and remain non-relativistic today (cold, or possibly warm dark matter). 
This is one of a few experimental evidences for New Physics beyond the Standard 


This 1% has to do with observationally allowed effect of spatial curvature. 
8Note that only 10% of baryons are in stars. Most of baryons are in hot gas. 
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Model of particle physics. Direct detection of dark matter particles is an extremely 
important, and yet unsolved problem of particle physics. 

According to current viewpoint, the rest of energy in the present Universe, 
about 70%, is homogeneously spread over space. This is not matter consisting of 
some unknown particles, but rather a unconventional form of energy of vacuum 
type. It is called by different names: dark energy, vacuum-like matter, quintessence, 
cosmological constant, A-term. We will use the term “dark energy” and will use 
the terms “quintessence” and “cosmological constant” for dark energy with specific 
properties: in the case of cosmological constant the energy density does not depend 
on time, while for quintessence weak dependence, instead, exists. 

It is not excluded that observational data which are quoted as showing the 
presence of dark energy can be explained in an alternative way. One possibility is 
that gravity deviates from General Relativity at cosmological distance and time 
scales. There is theoretical activity in the latter direction indeed, but it is out of 
the scope of this book to discuss it in any detail. We will assume throughout that 
gravitational interactions are described by General Relativity. 

We will further discuss dark energy and observations leading to this notion in due 
course. Here we mention the property that unlike the energy (mass) density of non- 
relativistic particles which decays as a~3(t) as the Universe expands, dark energy 
density either does not depend on time at all, or depends on time very weakly. Hence, 
at some stage of the cosmological evolution dark energy starts to dominate. The 
transition from matter dominated to dark energy dominated expansion occurred in 
our Universe at z ~ 0.5. 


Density of baryons and dark matter in clusters of galaxies is determined by various methods 
of measurement of the gravitational potential, i.e., total mass distribution. As an example, 
the left panel of Fig. 1.7 (see Fig. 13.4 on color pages) shows mass distribution in a 


Fig. 1.7. Cluster of galaxies CL0024 + 1654 [22]; see Fig. 13.4 for color version. 
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cluster, obtained by the method of gravitational lensing [22]. The idea is that light rays 
from galaxies residing behind the cluster get bent by the gravitational field of the cluster. 
This gives rise to multiple, distorted images of the distant galaxies® (see the right panel of 
Fig. 1.7). Hence, this method enables one to measure the gravitational potential in a cluster 
irrespectively of the sort of matter producing it. The result is that visible matter, whose 
density can be determined independently, makes rather small fraction of total mass; most 
of the mass is due to dark matter. The latter is clustered; its density is inhomogeneous 
over the Universe. Assuming that the ratio of dark and visible mass in the Universe is 
the same as in clusters of galaxies,'° one finds that the mass density of baryons and dark 
matter together constitutes about 30% of the total energy, 


pu © 0.3pe- (1.16) 


Gravitational lensing is by no means the only way to measure the gravitational poten- 
tial in clusters of galaxies. In particular, X-ray observations show that most of baryons in 
clusters are in hot, ionized intergalactic gas. The total mass of baryons in the gas exceeds 
the mass of baryons in luminous matter by an order of magnitude. As a matter of fact, 
X-rays are produced by electrons. So, the observations give the spatial distributions of 
their number density ne(r) and temperature T(r). Assuming spherical symmetry, one 
obtains the spatial distribution of total mass density p(r) from the equation of hydrostatic 
equilibrium, 


R 
= = -une (R)m, ZAP, M(R) = 4r I p(r)r?dr, (1.17) 
where une(R) is the number density of baryons in the gas, and yz is determined by its chem- 
ical composition (the gas is electrically neutral, so the numbers of baryons and electrons 
are the same up to factor u). The pressure P is mostly due to electrons, and it is related 
to temperature in the usual way, P(R) = ne(R)T-(R). All quantities in (1.17) except for 
M(R) are obtained from observations, so the mass M(R) is uniquely determined. Again, 
this method gives the results consistent with (1.16); see, e.g., Ref. [23]. 

The same conclusion follows from the study of the motion of galaxies and their groups 
in clusters. Assuming that the relaxation processes for galaxies have been over, one makes 
use of the virial theorem to infer the mass of a cluster, 


2 
3M (vz) = a. (1.18) 


Here M and R are the mass and size of a cluster, and (v2 ) 1/2 is the dispersion of projections 
of velocities on the line of sight. The latter is determined by the analysis of the Doppler 
effect in the spectra of either entire cluster or individual galaxies. The masses obtained 
from the virial theorem (1.18) by far exceed the sum of masses of individual galaxies in 
clusters (even including dark galactic haloes); this means that most of the mass is due to 
dark matter which is distributed smoothly over the cluster. 

It is known for a long time that nearby galaxy groups and clusters move towards 
Virgo constellation. Assuming that this motion is due to the gravitational attraction by 
the central cluster of galaxies, one estimates its mass. This estimate again shows that the 
mass of galaxies is too small, so this Virgo cluster contains dark matter. 


°This is called strong lensing, as opposed to weak lensing which only affects the intensity of light. 
10This is not an innocent assumption, since most of galaxies are not in clusters; conversely, clusters 
host only about 10% of galaxies and probably about 10% of dark matter. 
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Fig. 1.8. Observation of “Bullet cluster” 1E0657-558, two collided clusters of galaxies [24]; see 
Fig. 13.5 for color version. Lines show gravitational equipotential surfaces, the bright regions in 
the right panel are regions of hot baryon gas. 


A particularly convincing argument for dark matter in clusters of galaxies comes from 
the observation of clusters just after their collision. The result [24] is shown in Fig. 1.8 
(see Fig. 13.5 on color pages). Bright colors in the right panel show the distribution of 
hot gas whose X-ray emission has been observed by Chandra telescope. This gas contains 
about 90% of all baryons in both clusters, while galaxies add the remaining 10%. The 
gravitational potential is measured via gravitational lensing; the distribution of galaxies 
follows the gravitational potential. It is clear that the gravitational potential is not at all 
produced by baryons; rather, its source is dark matter. Dark matter and galaxies here 
are collisionless: they passed through each other and move away from each other with the 
original speed. Hot gas loses its velocity due to collisions, and it lags behind dark matter. 

Dark matter also explains the rotational velocities of stars, gas clouds, globular clusters 
and satellite dwarf galaxies at the periphery of galaxies. Under the assumption of circular 
motion, the dependence of velocity v(R) on distance R from the galactic center follows 
from Newton’s law, 


R 
v(R) = = M(R) = anf p(r)r’dr, 


0) 


where p(r) is the mass density. Observationally, v( R) = constant sufficiently far away from 
the center, see Fig. 1.9 [25], while the contribution of luminous matter to p would lead to 
v(R) x 1/VR. This is explained by assuming that luminous matter is embedded into dark 
cloud of larger size — galactic halo. 

Clustered dark matter could in principle consist of the usual particles, baryons and 
electrons, if they were contained in dark objects like neutron stars and brown dwarfs. These 
are dim, dense objects of small sizes. To cope with observations, the latter should be present 
not only in the disc of our Galaxy but also in the halo. Similar distribution should be 
characteristic to other galaxies. The density of these objects can be found observationally. 
Moving across the line of sight between the Earth and a star (say, from a nearby dwarf 
galaxy) these objects would serve as gravitational lenses. Some evidence of lensing of 
this type (microlensing) has indeed been observed [26], but it is by far not as frequent 
as necessary for explaining the whole of dark matter in the halo [26, 27]. Furthermore, 
many of the candidate compact objects are not suitable for other reasons. As an example, 
neutron stars are remnants of supernovae. The latter are main sources of oxygen, silicon 
and other heavy elements. The abundances of these elements in galaxies are well known. 
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Fig. 1.9. Rotation curve of a galaxy NGS 6503 [25]. Different curves show the contributions of 
three major components of matter to the gravitational potential. 


In this way the number of supernova explosions, and hence neutron stars, is estimated, 
and this number is definitely too small. 

There are several observational results showing the existence of dark energy. We already 
mentioned that the Universe is spatially flat to high precision: the present total energy 
density coincides with the critical density pe to better than 1%. On the other hand, the 
estimate for the density of clustered energy is given by (1.16), which is considerably smaller 
than pe. The rest of the energy density is attributed to dark energy. 

An independent argument for dark energy is as follows. We will see that the expansion 
rate of the Universe now and in the past depends on its energy content. The expansion 
rate determines in turn the relationship between redshift and visible brightness for distant 
“standard candles”. This relationship has been measured by making use of supernovae Ia 
(SNe Ia) which have been argued to have the properties of standard candles.'! Already the 
first data [28, 29] have shown that distant supernovae are relatively dimmer than nearby 
ones. This is interpreted as evidence for the accelerated expansion of the Universe today 
and in recent past. In General Relativity, accelerated expansion is possible only if there is 
dark energy whose density weakly depends on time (or does not depend on time at all). 

There are several other, independent arguments based, in particular, on the estimate 
of the age of the Universe, structure formation, cluster abundance, CMB anisotropy. All 
of them point to the existence of dark energy whose density today is at the level of 0.7pe. 
The hope is that future observations will shed light on the nature and properties of this 
energy component in our Universe. 


One of the candidates to dark energy is vacuum. Particle physics theories often 
ignore vacuum energy, as it serves merely as a reference point for the energy, while 


11Nearby SNe Ia show empirical relation between the absolute luminosity at maximum brightness 
and the time behavior of the light emission: brighter supernovae shine longer. Assuming the same 
relation for distant supernovae, their absolute luminosities are inferred from the durations of their 
bursts. 
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one is interested in masses and energies of particles — excitations about vacuum. 
Completely different situation occurs in General Relativity: vacuum energy, like any 
other form of energy, gravitates. If gravitational fields are not extremely strong, vac- 
uum is the same everywhere anytime, so its energy density is constant in space and 
time. Hence, vacuum energy does not cluster, so it is indeed an excellent candidate 
for dark energy. The problem, though, is that the energy density has dimension 
M*, and one would expect offhand that the value of vacuum energy density would 
be of the order of the fourth power of the mass scale of fundamental interactions. 
These scales are 1 GeV for strong interactions, 100 GeV for electroweak interactions 
and Mp, ~ 101° GeV for gravitational interactions themselves. Thus, one would 
estimate the corresponding contributions to vacuum energy as follows, 


Piae ~ 1 GeVź, for strong interactions 
~ 108 GeV, for electroweak interactions 
~ 10 GeV‘, for gravitational interactions. (1.19) 


Any of these estimates exceeds by many orders of magnitude the actual dark energy 
density 


pa ~ pe ~ 107° = ~ 10746 Gev*. (1.20) 
cm 
This is a serious problem for theoretical physics which is dubbed cosmological con- 
stant problem: it is a mystery that the vacuum energy density is so small compared 
to the estimates (1.19), and the even greater mystery is that it is different from 
zero (if dark energy is vacuum energy indeed). Without exaggeration, this is one of 
the major problems, if not the major problem, of fundamental physics. There are 
many puzzles and coincidences here, which require fine tuning of parameters of dif- 
ferent nature. Too large vacuum energy density (but still many orders of magnitude 
below the particle physics scales) would be incompatible with our existence: large 
and positive vacuum energy would lead to very fast cosmological expansion totally 
suppressing formation of galaxies and stars, while the Universe with large and neg- 
ative vacuum energy would recollapse long before any structure would form. A 
coincidence calling for explanation is that the three different energy components — 
dark energy, dark matter and baryons — are of the same order of magnitude in 
the present Universe (“Why now?” ). These components have different origins, so a 
priori they would give contributions of different orders of magnitude. 
Let us stress that vacuum is by no means the only dark energy candidate dis- 
cussed in literature; we will consider some other candidates later on in this book. 


1.4. Future of the Universe 


The future of the Universe is mostly determined by its geometry and the properties 
of dark energy. 
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We will see that the contribution of the spatial curvature into effective energy 
density is inversely proportional to the scale factor squared, a~. So, if spatial cur- 
vature is non-zero, it will sooner or later start to dominate over energy density of 
non-relativistic matter which decays like a7. 

Hence, in the long run, the competition will be between the spatial curvature 
and dark energy. If the latter is time-dependent and will relax to zero sufficiently 
rapidly in future, then the expansion of Universe with positive curvature (closed 
model) will slow down, then terminate, and eventually the Universe will recollapse 
to singularity. The Universe with negative spatial curvature will expand forever, 
though its expansion will slow down. Clusters of galaxies will move away from each 
other to larger and larger distances. The same would happen to galaxies not bound 
to clusters. All systems that are not gravitationally bound will disappear. The same 
properties hold for spatially flat Universe with dark energy relaxed to zero (but the 
expansion in that case will be even slower). 

If dark energy density does not depend on time, like in the case of vacuum 
energy,!? or depends on time weakly, then the dominant player will be dark energy. 
Positive dark energy will lead to exponential expansion; the Universe will expand 
forever with (almost) constant acceleration. 

One cannot exclude also the possibility that the dark energy will become negative 
in distant future. In that case the dark energy will slow the expansion down, and 
in the end the Universe will recollapse to singularity. 

We stress that it is in principle impossible to predict the ultimate fate of the 
Universe on the basis of cosmological observations only. These observations enable 
one, generally speaking, to figure out the dependence (or independence) of dark 
energy density on time in the past, but the behavior of dark energy in the future 
can only be hypothesized. To predict the distant future of our Universe, one would 
need to know the nature of dark energy (or, more generally, the precise reason 
for the accelerated expansion of the Universe at present). Whether and how such 
a knowledge can be obtained is hard to tell. Nevertheless, one can extrapolate, 
with reasonable confidence, the future evolution of the Universe in 10-20 billion 
years. During this time, the Universe will expand at rate comparable to the present 
Hubble rate. 


Yet another possibility discussed in literature is that the dark energy density will grow 
in future. If this growth is sufficiently fast, the Universe will end up in Big Rip: in 
finite time the scale factor will become infinite, interactions between particles will be 
insufficient to keep them in bound states, and all bound systems, including atoms and 
nuclei, will disintegrate and point-like particles will move away from each other to infinite 
distance. 


12We do not discuss here the possibility of the cosmological phase transition which would change 
the energy balance and thus have major effect on the evolution of the Universe. 
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1.5. Universe in the Past 


The very fact that the Universe expands clearly implies that it was denser and 
warmer in the past. On the basis of General Relativity and standard thermody- 
namics we will see that matter had higher and higher temperature and density at 
earlier and earlier epochs, and that at most stages it was in thermal equilibrium. 
Hot Big Bang theory is precisely the theory of such a Universe. Going back in 
time, and, accordingly, up in temperature, we find a number of particularly impor- 
tant “moments” (better to say, more or less lengthy periods) in the cosmological 
evolution; see Fig. 1.10. Let us briefly discuss some of them. 


1.5.1. Recombination 


At relatively low temperatures the usual matter in the Universe was in the state of 
neutral gas (mainly hydrogen). At an earlier stage, i.e., at higher temperatures, the 
binding energy was insufficient for keeping electrons in atoms, and the matter was in 
the state of baryon—electron—photon plasma. The temperature of recombination — 
the transition from plasma to gas — is determined, very crudely speaking, by the 
binding energy in hydrogen atom, 13.6eV. We will see, however, that recombination 
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Fig. 1.10. Stages of the evolution of the Universe. 
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occurred at a somewhat lower temperature, T ~ 0.26eV. This is a very important 
epoch: before it photons actively scattered off electrons in the plasma (and at even 
earlier times they were emitted and absorbed), while after recombination the neutral 
gas was transparent to photons. Hence, the CMB that we see today comes from the 
recombination epoch, and it carries information about the properties of the Universe 
at the epoch when its temperature was about 0.26 eV ~ 3000K and age about 370 
thousand years. 

We have already pointed out that the high degree of CMB isotropy shows 
that the Universe was highly homogeneous at recombination, the density perturba- 
tions dp/p were comparable to temperature fluctuations and were roughly of order 
1074—1075. Nevertheless, these perturbations in the end have given rise to struc- 
tures: first stars, then galaxies, then clusters of galaxies. 


In fact, the optical depth (scattering probability) for photons after recombination is differ- 
ent from zero and is equal to r ~ 0.08—0.10. The reason is the reionization in the Universe 
which begins at the time of active formation of the first stars, z < 20. 

The fact that hydrogen in the Universe is almost completely ionized (n#/np < 1075) 
at z <6, has been known for a long time from the observation of hydrogen emission lines 
of quasars: if the light from quasars traveled through hydrogen, it would get completely 
absorbed. Indeed, even though the light from quasars gets redshifted, there would be 
enough hydrogen atoms with appropriate velocities to absorb it. 

Evidence for the early reionization, z ~ 10, comes from CMB anisotropy and polar- 
ization. In particular, the polarization at large angular scales is due to scattering of relic 
photons off free electrons at that time. The number of electrons required to explain the 
optical depth 7 ~ 0.08—0.10 corresponds to complete ionization of cosmic hydrogen at 
z ~ 10—12 (or partial ionization at a somewhat earlier time). 


1.5.2. Big Bang Nucleosynthesis (BBN) 


Another important epoch in the cosmological evolution occurs at much higher tem- 
peratures, whose order of magnitude is determined, crudely speaking, by the scale 
of binding energies in nuclei, i.e., 1-10 MeV. Again, the actual temperatures are 
somewhat smaller; the reason is discussed in Chapter 8. In any case, at high tem- 
peratures protons and neutrons were free in cosmic plasma, but after the Universe 
cooled down due to expansion, neutrons have been captured into nuclei. As a result, 
besides hydrogen, there are light nuclei in primordial plasma: mostly helium-4 (the 
most tightly-bound light nucleus) and also a small amount of deuterium, helium-3 
and lithium-7; heavier elements were not synthesized in the early Universe.!* This 
epoch of Big Bang Nucleosynthesis (BBN) is the earliest epoch studied directly so 


13Heavy elements are produced during stellar evolution. In particular, one of the important ele- 
ments in the nucleosynthesis chain, carbon, is synthesized in the fusion of three *He-nuclei. This 
process is possible only at very high densities reached in stellar interiors after hydrogen has been 
burned out. All other elements are synthesized from carbon. Relatively light elements, includ- 
ing iron, are produced in thermonuclear reactions in stars; heavier elements are synthesized as 
a result of neutron capture in stars and supernova explosions, and some elements are produced, 
presumably, in the processes of proton or positron capture. 
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far: the calculation of the light element abundances makes use of General Relativ- 
ity and known microscopic physics (physics of nuclei and weak interactions), while 
measurements of these primordial abundances, though difficult, are quite precise by 
now. 

Good agreement between BBN theory and observations is one of the cornerstones 
of the theory of the early Universe. Let us stress that BBN epoch lasted from about 
1 to 300 seconds after the Big Bang, the relevant temperatures range from 1 MeV 
to 50keV. 


The difficulty in measuring primordial element abundances is that most of the baryonic 
material in the Universe has been reprocessed in stars, and its composition is different 
from that of primordial plasma. Nevertheless, it is possible to find places in the Universe 
where matter can be claimed, with good confidence, to have not been reprocessed, and its 
composition coincides with the primordial composition. 


1.5.3. Neutrino decoupling 


While photons last scattered at temperature 0.26 eV, neutrino interactions with cos- 
mic plasma, as we will see, terminated at temperature 2-3 MeV. Before that, neutri- 
nos were in thermal equilibrium with the rest of matter, and after that, they freely 
propagate through the Universe. We will calculate the temperature and number 
density of neutrinos, and here we note that they are of the same order of magnitude 
as the temperature and number density of photons. Unfortunately, direct detection 
of relic neutrinos is a very difficult, and may even be an unsolvable problem. 


Problem 1.9. If there exist neutrinos of ultra-high energies in Nature, they may 
scatter off relic neutrinos. The neutrino-neutrino cross-section in the Standard 
Model of particle physics is very small. Its maximum value, opp = 0.15 ub = 
1.5 -10731 em?, is reached at center-of-mass energy \/s ~% Mz ~ 90 GeV when neu- 
trinos pair-annthilate through the resonant Z-boson production. The observation of 
photons produced in Z-decay chains would be indirect evidence for the existence of 
relic neutrinos. 

Find the mean free path of an ultra-high energy neutrino in the present Universe 
with respect to the above process. Does one expect a cutoff in the spectrum of ultra- 
high energy neutrinos, similar to the GZK cutoff in the spectrum of ultra-high energy 
protons (see Problem 1.8)? 


The role of neutrinos in the present Universe is not particularly important. 
However, the neutrino density in the early Universe is an important parameter 
of BBN theory. Primordial nucleosynthesis occurs in expanding Universe, and the 
neutrino component affects the expansion rate and hence cooling rate of plasma 
at the time of BBN. The latter is important for inequilibrium processes of the 
light element synthesis. The success of BBN theory gives decisive evidence for the 
existence of relic neutrinos. 
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Relic neutrinos play a role in structure formation; they affect CMB angular 
spectrum too. A combination of cosmological data yields the limits on the sum of 
masses of all neutrino cpecies which are in the range [94, 99] 


5 my < 0.2 +0.5 eV. 


Note that these limits are stronger than direct experimental bound. It is not unlikely 
that cosmological methods will actually provide the determination of the neutrino 
masses. 

In this and the accompanying book, we will study relic neutrinos and their effects 
in some details. 


1.5.4. Cosmological phase transitions 


Going further back in time, we come to the epochs which have not been directly 
probed by observations so far. Hence, we have to make more or less reasonable 
extrapolations. It is likely that the history of the hot Universe goes back to temper- 
atures of the order of hundreds GeV and quite possibly to even higher temperatures. 
At so high temperatures there are epochs of interest, at least from the theoretical 
viewpoint. Some of them can be loosely called epochs of phase transitions. 

— Transition from quark-gluon matter to hadronic matter. Its temperature; 
is determined by the energy scale of strong interactions and is about 200 MeV. At 
much higher temperatures quarks and gluons behave as individual particles (rather 
strongly interacting towards the transition epoch), while at lower temperatures they 
are confined in colorless bound states, hadrons. At the same time (or almost the 
same time) there was the transition associated with chiral symmetry breaking. 

— Electroweak transition [30-33]. Simplifying the situation we can say that at 
temperatures above 100 GeV (energy scale of electroweak interactions), the Englert— 
Brout—Higgs condensate is absent, and W- and Z-bosons have zero masses. The 
present phase with broken electroweak symmetry, Higgs condensate and massive 
W- and Z-bosons is the result of the electroweak transition! that occurred at 
temperature of order 100 GeV. 

— Grand Unified transition. There are hints towards Grand Unification, the 
hypothesis that at energies and temperatures above 1016 GeV there is no distinction 
between strong, weak and electromagnetic interactions: these interactions are unified 
into a single force. If so, and if these temperatures existed in the Universe, then at 
the temperature of Grand Unification Teyr ~ 1018 GeV there was the corresponding 
phase transition. We note, however, that maximum temperature in the Universe may 
well be below Toy, so the phase of Grand Unification may not exist in the early 


M4This is likely a smooth crossover rather than phase transition proper. 

15Tn fact, the situation is somewhat more complicated: an order parameter is absent in electroweak 
theory (at least within the Standard Model of particle physics), so the phase transition can be 
absent. Indeed, given the experimental bound on the Higgs boson mass, there is smooth crossover 
in the Standard Model instead of the phase transition. 
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Universe. This is the case in many models of inflation: there, the reheat temperature 
is lower than Toy. 


1.5.5. Generation of baryon asymmetry 


The present Universe contains baryons (protons, neutrons) and practically no 
antibaryons. Quantitative measure of the baryon abundance is the ratio of baryon 
and photon number densities. The studies of BBN and CMB give 


1s = 2 = 6.05: 1071 (1.21) 
Y 


with precision of about 1%. The baryon number is conserved at sufficiently low 
energies and temperatures, and we will see that in the early Universe n,/ny is of 
the same order of magnitude as given in (1.21). Thus, baryon asymmetry ns is one 
of the most important parameters of cosmology. 

At temperatures of hundreds of MeV and higher, there were a lot of quarks and 
antiquarks in the cosmic plasma, which were continuously pair-created and anni- 
hilated. Thus, unlike in the present Universe, there were almost as many particles 
with negative baryon number (antiquarks) as those with positive baryon number 
(quarks). Simple thermodynamical arguments, to be given in this book, show that 
the number of quark—antiquark pairs at high temperatures is about the same as the 
number of photons, so the baryon asymmetry can be understood as determining 
the following ratio 

PaT xg ~ 107%, (1.22) 

Ng + Ng 
Here nq and ng are number densities of quarks and antiquarks, respectively. We see 
that in the early Universe, there existed one uncompensated quark per 10 billion 
of quark—antiquark pairs. It is this tiny excess that is responsible for the existence 
of baryonic matter in the present Universe: as the Universe expanded and cooled 
down, antiquarks annihilated with quarks, while uncompensated quarks remained 
there and in the end formed protons and neutrons. 

One of the problems of cosmology is to explain the very existence of the baryon 
asymmetry [34, 35], as well as to understand its value (1.21). It is extremely implau- 
sible that the small excess of quarks over antiquarks (1.22) existed in the Universe 
from the very beginning, i.e., that it is one of the initial data of the cosmological 
evolution; it is much more reasonable to think that the Universe “in the begin- 
ning” was baryon-symmetric. The same conclusion comes from inflationary theory. 
Asymmetry (1.22) was generated in the course of the cosmological evolution due to 
processes with baryon number non-conservation. We will discuss possible mecha- 
nisms of generation of this asymmetry, but we stress right away that today there is 
no unique answer to the question of its origin. Here we note only that baryon asym- 
metry was generated most probably at very high temperatures, at least 100 GeV and 
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maybe much higher, although its generation at lower temperatures is not completely 
excluded. 

The problem of the baryon asymmetry cannot be solved within the Standard 
Model of particle physics. This is another cosmological hint towards New Physics 
beyond the Standard Model. 


1.5.6. Generation of dark matter 


Which particles make non-baryonic clustered dark matter is not known experimen- 
tally. One expects that these are stable or almost stable particles that do not exist 
in the Standard Model of particle physics. Hence, the very existence of dark matter 
is a very strong argument for incompleteness of the Standard Model. This makes 
the detection and experimental study of the dark matter particle extremely inter- 
esting and important. On the other hand, the lack of experimental information on 
the properties of these particles makes it impossible to give a unique answer to the 
question of the mechanism of the dark matter generation in the early Universe. We 
will discuss various dark matter particle candidates in this book, and here we make 
one remark. We will see that hypothetical stable particles of mass in the GeV— 
TeV range, whose annihilation cross-section is comparable to weak cross-sections, 
do not completely annihilate in the course of the cosmological evolution, and that 
their resulting mass density in the present Universe is naturally of the order of the 
critical density pe. Therefore, these particles are natural dark matter candidates, 
especially because they exist in some extensions of the Standard Model, including 
supersymmetric extensions. 

Particles which we have briefly described are called WIMPs (weakly interacting 
massive particles). Their freeze-out, i.e., termination of annihilation, occurred at a 
temperature somewhat below their mass, i.e., T ~ 1—100 GeV. 

Of course, there are many other dark matter particle candidates besides WIMPs. 
These include axion, gravitino, sterile neutrino, etc. Some of them are discussed in 
this book. 


1.6. Structure Formation in the Universe 


We discussed in previous Sections the most important stages of the cosmological 
evolution. Each of them, be it nucleosynthesis or recombination, has finite time 
duration. There is, however, a process in the Universe that began at a very early 
epoch and continues at present. This process is formation of structures — first 
stars, galaxies, clusters of galaxies, superclusters. The order we put them here is 
not random: smaller objects get formed earlier. 

The theory describing structure formation is based on the Jeans instability, the 
gravitational instability of matter density perturbations. It should be assumed, of 
course, that the perturbations have already existed at the very early stage of the 
cosmological evolution, even though they were very small in amplitude. 
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The source of primordial perturbations is not particularly relevant to the theory 
of structure formation. Rapid growth of density perturbations occurs at that stage 
of the cosmological expansion when the dominating energy component is mass of 
non-relativistic matter. Transition to this stage happened 50 thousand years after 
Big Bang; before that the Universe was so hot that the dominant component was rel- 
ativistic (“radiation”). The epoch of this transition is called radiation—matter equal- 
ity. At that time the density perturbations had small amplitude, dp/p ~ 1074—1075. 
Regions of higher density are sources of gravitational potential, they attract sur- 
rounding matter, and the density in these regions becomes even higher. This is 
precisely the physical reason for gravitational instability.!° Once the overdensity is 
large enough, the overdense region becomes gravitationally bound and starts living 
its own life; in particular, the size of this region does not grow despite the expan- 
sion of the Universe. Instead, gravitational interaction within this region leads to its 
collapse to an object of much smaller size. In this way protostars and protogalaxies 
get formed. First stars were formed at z ~ 10 and somewhat earlier. Active galaxy 
formation occured at redshifts 2—4, though some galaxies were formed much earlier: 
there is evidence for galaxies at redshifts of order 10. 

The mass of an object — galaxy, cluster of galaxies — is determined by the 
size of the primordial overdense region. So, the number densities of galaxies and 
clusters and their mass distribution reflect the spectrum of primordial perturbations. 
Existing observational data on structures are consistent with the simplest “flat” 
primordial spectrum, called Harrison—Zeldovich spectrum. The defining property of 
the latter is its scale invariance: in a certain sense, perturbations of different sizes 
have one and the same amplitude. 

Perturbations existing in cosmic medium at recombination give rise to CMB 
temperature anisotropy and polarization. Hence, the primordial spectrum can be 
determined also from the CMB observations. Notably, the spectra found from CMB 
and structures are in good agreement with each other. 

Structure formation gives yet another argument for the presence of dark matter: 
without dark matter, density perturbations would start to grow after recombination 
only, and by now they would not have developed into structures yet. Furthermore, 
it follows from the theory of structure formation that the major part of dark matter 
must be cold (or warm), i.e., it must consist of particles that became non-relativistic 
at a very early epoch. If a large fraction of dark matter were hot, i.e., consisted 
of particles remaining relativistic until late times, then the formation of objects 
of relatively small size would be suppressed, in contradiction to observations. An 
important example of hot dark matter is the neutrino of mass m, ~ 1-107 eV. 
This suggests that cosmology is capable of setting bounds on neutrino masses. 

We discuss in the accompanying book the structure formation and the role played 
in it by different components of cosmic matter. 


16This mechanism does not work for relativistic particles, since weak gravitational field cannot 
keep them inside an overdense region. 
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1.7. Before the Hot Epoch 


1.7.1. Argument from observations 


Cosmological observations unambiguously suggest that the hot epoch of the cos- 
mological evolution discussed in Sec. 1.5 was preceded by some other epoch with 
quite unusual properties. We discuss this issue in detail in the accompanying book, 
and here we point out that this remarkable conclusion follows from the analysis of 
matter perturbations — the perturbations which in the end have become galax- 
ies and other structures. The perturbations in baryon—electron—photon component 
before recombination were nothing but acoustic waves!’; we will see that the sound 
speed in this medium u, ~ 1/ /3. The wavelength of each of the acoustic wave 
increases due to the expansion of the Universe, A(t) « a(t), while the wave vector 
(momentum) q(t) = 27/X(t) decreases, 


where k is a time-independent quantity called conformal momentum. The frequency 
of each wave also decreases, 


w(t) = usq(t). 


The time-dependence of the density contrast of the baryon—electron—photon com- 
ponent for a mode of a definite wave vector k has oscillatory behavior (with quali- 
fications which we do not specify here) 


? 


t 
PET i) = Ax cos / ae a+ on, (1.23) 
PBy o a(t) 


where A, is the amplitude of the wave, px is a time-independent phase, and notation 
By refers to baryon-electron—photon component. 

This oscillatory behavior was not characteristic of the acoustic waves at all 
times. At the radiation domination epoch, the scale factor increases as a(t) « vt, 
whereas the Hubble paramter equals H(t) = (2t)~!. Therefore, at early times the 
Hubble parameter was larger than frequency, H(t) > w(t), i.e., the expansion was 
faster than the oscillations, and the oscillations simply had no time to occur. This 
property can be also phrased in the following way. Suppose that the cosmological 
evolution began right from the hot epoch. The signals emitted at the Big Bang 
and propagating with the speed of light would travel by time t to distance ly (t) ~ 
t~ H~'(t). This would be a maximum size of causally connected region at time t, 
i.e., the cosmologcal horizon size. Since for given k the wavelength X(t) grows like 


17 There were density perturbations in dark matter too, but this is not particularly important here. 
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a(t) x vt, we have for small t 
A(t) >> l(t). 


This means that for each wave there was an epoch when its wavelength exceeded 
the horizon size! In such a situation the perturbation is called superhorizon. We 
note that the perturbations responsible for galaxies, and even more so for clusters 
of galaxies, were superhorizon until rather late times: their horizon entrance (time 
at which A(t) ~ la(t)) occured at fairly low temperatures, T < 100 keV, i.e., after 
the nucleosynthesis epoch. So, there is no guesswork at this point. 

It is highly implausible that the primordial perturbations were built in as an 
initial condition of the cosmological evolution. It is natural to think, instead, that 
they were generated by some physical processes. Then causality guarantees that 
this generation cannot happen at early hot epoch when the perturbation wave- 
length exceeds the cosmological horizon size. Thus, there are two options: either 
the perturbations were generated relatively late at the hot epoch (when their wave- 
lengths were smaller than the horizon size) or the hot epoch was preceded by some 
other epoch — the epoch responsible for the generation of perturbations. 

It is remarkable that the cosmological data discriminate between the two options: 
the data tell that the perturbations were actually generated at some pre-hot epoch. 
The point is that if the perturbations existed already at early hot epoch when they 
were superhorizon (and hence they were generated before the hot epoch), the phase 
of oscillations in (1.23) is uniquely determined.!® For example, this phase equals 
zero for not very long waves, Yk = 0; this is the property of solutions to the wave 
equations in the Universe that expands as a(t) x vt. On the other hand, if the 
perturbations were generated at the late hot epoch, when they were subhorizon, 
there is no reason for the phase to be fixed: different waves can have different 
phases. One expects, instead, that the phase is a random function of momentum k. 
The latter property is indeed characteristic of concrete models of the generation of 
density perturbations at the hot epoch, which will be briefly discussed in Chapter 12. 

Once the phase in (1.23) is fixed (let us take for definiteness py, = 0, which is 
correct for most of perturbations, except for the those with the largest wavelengths), 
the waves at the recombination time t, have a definite phase which depends on the 
wave vector k: 


tr 
Lar A = Ax cos (| eo at), (1.24) 
PBy 0 a(t) 

As a result, the density contrast at recombination, ôpg4(k), is an oscillating function 
of k; for some momenta, the density (and hence temperature) is at maximum while 
for other momenta it is, roughly speaking, equal to zero. This shows up in the 
oscillatory behavior of the CMB temperature angular spectrum, see Fig. 1.6. The 


18Under an assumption that the perturbations are adiabatic; this assumption is confirmed by the 
data. 
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perturbations for which cosine in (1.24) equals +1 are the largest in amplitude. 
They are seen at definite angles, which correspond to the peaks in Fig. 1.6. Notably, 
the value of the initial phase y, in (1.23) inferred from observations agrees with the 
theoretical prediction. 

The oscillatory dependence of ĝppgy(tr; k) on momentum is reflected also in the 
property of the galaxy distribution in the late Universe, called baryon acoustic 
oscillations (BAO). This feature has been also observed experimentally. 

On the other hand, if the phase y, in (1.23) took random values for various 
momenta, the oscillatory behavior of the CMB angular spectrum and BAO would 
be absent, in contradiction to the observations. 

Thus, observational cosmology definitely requires that the phase y, in (1.23) is 
uniquely defined, and this means that the density perturbations existed at the very 
beginning of the hot epoch and hence were generated before that epoch. 


1.7.2. Drawbacks of the Hot Big Bang theory 


Besides the problem of primordial perturbations, the Hot Big Bang theory briefly 
discussed in Sec. 1.5 has other drawbacks. These are due to the fact that this theory 
needs very special — and unnatural — initial conditions, otherwise it would grossly 
fail to describe the early and present Universe. We discuss in detail the problem of 
initial conditions in the accompanying book, and here we give two examples showing 
what sort of problems we are talking about. 

The first of these examples has to do with the properties of the cosmological 
horizon in the Hot Big Bang theory. As we pointed out, the horizon size ly(t) 
at time t in this theory equals t modulo a numerical coefficient of order one. In 
particular, the horizon size at recombination is ly (tr) ~ 10° light yrs œ 300 kpc. 
Since then the size of this region has increased due to the expansion of the Universe, 
and today it is equal to ly (tr )ao/a(tr) ~ 300 Mpc. This is substantially smaller than 
the size of the present horizon ly(to) ~ 15 Gpc. Thus, the observable part of the 
Universe contains of order 50° ~ 10° regions which were causally disconnected at 
recombination epoch (within the Hot Big Bang theory). One would think that these 
regions should be quite different from each other. However, their properties both 
today and at recombination are exactly the same; in particular, the temperature of 
the CMB photons coming from different regions is the same to better than 0.01%. 
This inconsistency is called the horizon problem: the homogeneity of the Universe 
at large scales does not have natural explanation in the Hot Big Bang theory and 
can only be obtained within this theory by imposing very special initial conditions. 

Another example is somewhat different. As we already know, the Universe is 
warm, so it can be characterized by its entropy. Entropy density is of order of 
photon number density; in the present Universe 


s ~ 10? cm™?. 
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Thus, the estimate for the entropy in the observable part of the Universe, whose 
size is Ro ~ 104 Mpc ~ 107° cm, is 


S ~ sR ~ 10%, 


This huge dimensionless number is one of the properties of our Universe. Why does 
the Universe have such a large entropy? The Hot Big Bang theory does not answer 
this question, since entropy is (almost) conserved during the hot stage. The huge 
entropy has to be introduced into the Hot Big Bang theory “by hands”, as one of 
the initial data. This uncomfortable situation is called the entropy problem. 


1.8. Inflationary Theory 


Both the problem of initial conditions and the problem of primordial perturba- 
tions find elegant solutions in inflationary theory [40-44]. According to this theory, 
the hot cosmological epoch was preceded by the epoch of exponential expansion 
(inflation). During the inflationary epoch, the initially small region of the Universe 
(whose spatial size was comparable, say, to the Planck length lpı) inflated to a 
very large size, typically many order of magnitudes larger than the size of the part 
of the Universe we see today. This in the end explains flatness, homogeneity and 
isotropy of the observable part of the Universe. Due to the exponential character 
of expansion, the duration of the inflationary epoch may be short: the initial con- 
dition problems of the Hot Big Bang theory is solved provided that the duration is 
greater than (50—70) H zj where Hin s, is the Hubble parameter at inflation, say 
Hingi ~ 10-°Mp; (Hingi may be quite a bit smaller). In that case, the minimum 
duration of inflation is of order 10°tp,; ~ 107°°s. It is likely that inflation lasted 
much longer, but in any case it is plausible that we deal with microscopic time scale. 

The inflationary regime occurs if the energy density in the Universe depends on 
time very weakly. Energy density of conventional matter — gases of particles — does 
not have this property. Therefore, most models of inflation make use of hypothetical 
new field(s).19 Under certain conditions this new field — inflaton — is spatially 
homogeneous and changes slowly in time in inflating region. Its potential energy 
changes slowly too, and this gives rise to the inflationary expansion regime. 

At some moment of time the conditions for inflationary expansion get violated, 
and inflation terminates. A new epoch — post-inflationary reheating — sets in, at 
which the inflaton energy is tranferred to the energy of conventional matter. As a 
result, the Universe heats up to very high temperature, and the Hot Big Bang epoch 
begins. Reheating occurs with entropy generation, which provides the solution to 
the entropy problem. 


19The reservation here alludes to the possibility of employing the Englert—Brout—Higgs field of 
the Standard Model of particle physics [316]. Yet another possibility is to have extra terms in the 
action for gravitational field. This case is often equivalent to the introduction of new field(s). 


32 Cosmology: A Preview 


Inflationary theory was originally proposed as a solution to the initial data 
problems, but it soon became clear that it also solves the problem of primordial 
perturbations [45-49]. The original source of perturbations are vacuum fluctuations 
of quantum field(s), in the simplest version, vacuum fluctuations of the inflaton 
field itself. These fluctuations get strongly enhanced at inflationary stage due to 
strong time-dependence of the gravitational field in the Universe. At the end of 
inflation they are reprocessed into density perturbations of conventional matter. 
The amplitude of primordial perturbations depends on unknown parameters of a 
model, but their power spectrum (dependence on wavelength) is uniquely calculable 
within a given model of inflation. Notably, most models predict almost flat (almost 
Harrison—Zeldovich [50-52]) power spectrum, in gross agreement with observational 
data. However, a typical prediction is a slight tilt in the spectrum, which makes it 
different from the Harrison—Zeldovich one. Small red (negative) tilt of the power 
spectrum has been recently discovered by the analysis of the CMB data. 

Another prediction of inflationary models is the existence of primordial gravita- 
tional waves [40]. They are also generated at inflationary stage from vacuum fluc- 
tuations, now of the gravitational field. In some models the amplitudes of gravity 
waves of wavelengths comparable to the present Hubble size are of order 10~°—10~°. 
These gravity waves would affect CMB temperature anisotropy [53-56] and polariza- 
tion [57-62]. These effects have not been discovered yet, but they may be observable 
in future. Discovery of primordial gravity waves will not only be a strong argument 
in favor of inflation, but also will enable us to determine its fundamental parameter, 
the inflationary Hubble scale Hin f1. 

Presently, inflationary theory is well developed. We study various aspects of this 
theory in the accompanying book. 


1.8.1. Alternatives to inflation 


Cosmological inflation is by far the most popular and best understood scenario 
for the epoch that preceeded the Hot Big Bang epoch. Yet there are alternatives, 
which have not been ruled out. One alternative is the bouncing Universe which starts 
from contraction, experiences the bounce and then enters the expansion regime (a 
version is pulsating Universe). We briefly discuss the bouncing Universe scenario in 
the accmpanying book. Another option is the Genesis scenario [63] which assumes 
that the Universe was initially empty and had Minkowski geometry, then the energy 
density grew and the Universe expanded at a higher and higher rate until this regime 
terminated and the Universe entered the hot epoch. These scenarios require very 
exotic fields, more exotic than the inflaton. 

As opposed to inflation, its alternatives predict unobservably small amplitude of 
primordial gravity waves. There are other, more subtle predictions which discrim- 
inate these scenarios from inflation and which can be discovered observationally, 
at least in principle. It is quite plausible that with the development of theory and 
observations, we will soon or late learn which regime preceded the Hot Big Bang 
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epoch. Isn’t it fascinating that the observations of the Universe at large are capable 
of sheding light on the first moments of the cosmological history characterized by 
extremely high energy densities and expansion rates and extremely small distance 
and time scales? 

To conclude our preview we note that it misses some of the topics which we 
study in this book. We hope, nevertheless, that this preview makes the content of 
the book reasonably clear. 


Chapter 2 


Homogeneous Isotropic Universe 


In this book we use the basic notations and equations of General Relativity; see 
Appendix A. The notations and conventions are summarized in Sec. A.11. 


2.1. Homogeneous Isotropic Spaces 


To a very good approximation, our Universe is homogeneous and isotropic at 
sufficiently large scales. This means that at given moment of time, the geometry of 
space is the geometry of homogeneous and isotropic manifold. There are only three 
such manifolds! (up to overall scale): 3-dimensional sphere, 3-dimensional Euclidean 
space (3-plane) and 3-dimensional hyperboloid. The geometry of 3-dimensional 
sphere is best understood by imagining that it is embedded into (fictitious) 
4-dimensional Euclidean space and writing the equation of 3-sphere in the stan- 
dard form, 


(y')? + (y?)? + (y°)? + (y*)? = R, 


where y® (a = 1,...,4) are coordinates of the 4-dimensional Euclidean space and 
R is the radius of the 3-sphere. Let us introduce three angles x, 0 and ¢, so that 


yt = Reosx, 
y? = Rsin x cos, (2.1) 
y? = Rsin x sin 6 cos 4, 


yt = Rsiny sin f sin ¢. 


Then the distance between two points on the 3-sphere with coordinates (x, 0, ¢) 
and (x + dx, 0 + d0,¢+4+ dẹ) is 


dl? = (dy)? + (dy?)? + (dy)? + (dy*)? 


lit is important that the 3-dimensional metric has Euclidean signature, i.e., it can locally be cast 
into the standard form dl? = (dx)? + (da?)? + (dx?)?. 
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= R?{{d(cos y)]? + [d(sin x cos 0)]? + [d(sin x sin 8 cos ¢)]? (2.2) 
+ [d(sin x sin 6 sin ¢)]?}. 


After simple calculation we find that the metric of the 3-sphere has the following 
form, 


3-sphere: dl? = R?|dx? + sin? y(d6? + sin? 6d¢”)). (2.3) 


Note that no trace of the fictitious 4-dimensional Euclidean space is left in this 
formula, as should be the case. 

In analogy to 3-sphere, 3-dimensional hyperboloid is conveniently described by 
its embedding into fictitious 4-dimensional Minkowski space with metric 


ds? = —(dy’)” + (dy*)? + (dy*)* + (dy*)?, 
while the equation for the hyperboloid is 
(y*)? — (y?)? — (y?)? — (y*)? = BR. (2.4) 


We are interested in the one connected component y! > 0. 


Problem 2.1. Show that the hyperboloid is indeed a homogeneous and isotropic 
space. Hint: Begin with defining what precisely is a homogeneous and isotropic space. 


Coordinates on 3-hyperboloid can be introduced in analogy to (2.1): 
yt = Reoshy, 
y? = Rsinhy cos 8, 
y? = Rsinhy sin 6 cos 4, 
y* = Rsinhy sin @sin ¢. 


The calculation of the distance between two points on the hyperboloid is analogous 
to (2.2). It gives 


3-hyperboloid: dl? = R?[dx? + sinh? (d6? + sin? 6d¢)]. (2.5) 


For completeness, let us also write the metric of 3-plane (3-dimensional Euclidean 
space), 


3-plane: dl? = (da)? + (da)? + (dz? Y’. (2.6) 


One of the properties of homogeneous and isotropic spaces is that all covariant 
geometrical quantities are expressed through the metric tensor yi; and, possibly, 
tensors 6! and Eijk, the latter existing in any Riemannian space; see Appendix A 
(here i,j = 1,2,3; we denote the metric tensor of 3-dimensional space by y;,; to 
distinguish it from the metric tensor g,,, of 4-dimensional space-time). Furthermore, 
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the coefficients do not depend on coordinates. In particular, the Riemann tensor is 
equal to 


n 
(3) Rijki = Fe (Vik Vl = eran (2.7) 


where we introduced the parameter x = 0, +1, which distinguishes 3-plane, 3-sphere 
and 3-hyperboloid, 
+1, 3-sphere 
n= 0, 3-plane ; (2.8) 
—1, 3-hyperboloid 


It follows from (2.7) that the Ricci tensor is 
wn 


Rij = 2 R? 


Tij» (2.9) 
and the curvature scalar is constant in space and equal to 6xR~?. 
Problem 2.2. Obtain the relations (2.7) and (2.9) by direct calculation. 


Metrics of 3-sphere, 3-hyperboloid and 3-plane can be written in a unified form. To 
this end, let us first note that by introducing the coordinate p = Ry on the sphere 
and hyperboloid, and spherical coordinates (p,6,) on 3-plane, the metrics (2.3), 
(2.5) and (2.6) can be written as follows, 


dl? = dp? + r*(p)(d0” + sin? 6d¢”), (2.10) 
where 
R sin(p/R), 3-sphere 
r(p) = p, 3-plane : (2.11) 
R sinh(p/R), 3-hyperboloid 
The interpretation of quantities entering (2.10) is obvious: p is the geodesic 
(shortest) distance from the origin to a point with coordinates (p,6,), while 
r(p) determines the area of two-dimensional sphere at distance p from the origin, 


S = 4rr? (p). It is also clear that an interval of length / at distance p from the origin 
is seen from the origin at angle 


Instead of p, one can choose r as the radial coordinate. With this choice we obtain, 
e.g., for hyperboloid, 


38 Homogeneous Isotropic Universe 


so that the metrics of the three spaces take the following form, 


d 2 
dl? = ——__. + r?(d6? + sin? 6d¢2), (2.12) 
1- xi 

where the parameter x is the same as in (2.8). Note that the coordinates (r, 0, œ) 
cover only half of the 3-sphere: the region 0 < r < Ris part of the 3-sphere extending 
from the origin (pole) to the 2-dimensional surface of maximum area (equator of 
the 3-sphere). This is the reason for the coordinate singularity in the metric (2.12) 
at r= R for x = 1. 


2.2. Friedmann—Lemaitre—Robertson—Walker Metric 


Expanding homogeneous isotropic Universe is described by the metric 
ds? = dt? — a?(t)y;j;dx'dx!, (2.13) 


where 7;(z) is the metric of unit 3-sphere (metric (2.3) with R = 1), unit 
3-hyperboloid or 3-plane. Metric (2.13) is called the Friedmann—Lemaitre— 
Robertson—Walker metric (FLRW). One distinguishes closed Universe (the space 
is 3-sphere, x = +1), and open and flat Universe (the space is 3-hyperboloid and 
3-plane, x = —1 and x = 0, respectively). For closed and open Universe, the scale 
factor a(t) at given moment of time is the radius of spatial curvature. On the other 
hand, for spatially flat Universe, the scale factor itself does not have physical sig- 
nificance, since at a particular moment of time it may be set equal to any number 
(say, unity) by rescaling the spatial coordinates. What has the physical meaning in 
the flat Universe is the ratio of scale factors at different times, a(t,)/a(t2), and, in 
particular, the Hubble parameter 


Hereafter, the dot denotes derivative with respect to time. 

Note that in the cases of closed and open Universe the spatial coordinates zt 
entering (2.13) are dimensionless, while the scale factor has dimension of length. 
On the other hand, in the case of spatially flat Universe it is convenient to assign 
the dimension of length to the coordinates x‘, while treating the scale factor as a 
dimensionless quantity. 

The metric of homogeneous and isotropic Universe has the form (2.13) in a 
certain reference frame. This frame is singled out by the fact that space is the same 
everywhere at each moment of time. Furthermore, this frame is comoving: world 
lines of particles which are at rest with respect to this frame are geodesic, i.e., these 
particles are free. Before showing that, let us point out that for these particles one 
has ds? = dt?, i.e., the time coordinate t has the meaning of proper time of particles 
at rest. In the present Universe, these particles may be thought of as galaxies, if one 


2.2. Friedmann—Lemaitre—Robertson—Walker Metric 39 


disregards their local (peculiar) motion caused by gravitational potentials (which 
are due to, e.g., nearby galaxies). 

Let us show that the world line zê = const obeys the geodesic equation in 
metric (2.13), 


—— +T“ uu’ =0, (2.14) 


where u” is 4-velocity (see Appendix A). To this end, let us calculate the Christoffel 
symbols, 


1 
art = zI” (9X0 + On Ive = Oc Ju): (2.15) 


Nonzero components of the metric tensor are 


goo =1, gij = —a7(t) 74; (£), 


while for the inverse tensor we have 


It is clear that 
T9% =0, r9 =0, Tġo=0 
(in the expression in parenthesis in (2.15), at least two indices are equal to zero, 


but goi = 0, ôu goo = 0). For T$; we have 


i lik a ci 
ri = 59 og; = z% (2.16) 


The remaining Christoffel symbols are also straightforwardly calculated, 
T} = atij, (2.17) 
ann (2.18) 


where Or are the Christoffel symbols for metric yij. 
Let us now turn to Eq. (2.14). The only non-vanishing component of the 
4-velocity u” = dx" /ds of a particle at rest is 


pues 4G. 

ds dt 
Equation (2.14) is obviously satisfied, since du”/ds = 0 and To = 0 for any p. 
Thus, the world lines of particles which are at rest in our reference frame are indeed 
geodesic. 


40 Homogeneous Isotropic Universe 


Problem 2.3. If one chooses a time coordinate which does not coincide with proper 
time of particles at rest, FLRW metric takes the form 


ds? = N?(t)dt? — a?(t)yijdzr'da. 


Show by direct calculation that in this metric too, world lines of particles at rest 
are geodesic (this is of course obvious, since these lines are the same lines as in the 
text). 


To end this Section, we note that for both closed and open models, the spatial 
curvature can often be neglected, so that one can use the spatially flat metric 


This is certainly possible for processes at spatial scales much smaller than the 
curvature radius a(t). We already mentioned in Chapter 1 that our Universe is 
spatially flat to a very good approximation, both in the past and at present. Hence, 
the approximation (2.19) is actually very good for all scales. We will quantify this 
statement in what follows. 


2.3. Redshift. Hubble Law 


The scale factor a(t) grows in time, and the distances between points of fixed spatial 
coordinates x’ grow too — the Universe expands. Because of that, the wavelength 
of a photon emitted long ago by a distant source increases as the photon moves 
towards an observer, i.e., the photon wavelength gets redshifted. To descibe this 
phenomenon, let us write the action for free electromagnetic field in space-time 
with metric g,,(2): 


1 V 
where, as usual, 
Fiw =N gáv — VuAp = OpAv — Ay, 


and A,, is the electromagnetic vector-potential. The action (2.20) is the simplest 
covariant generalization of the action of Maxwell’s electrodynamics; the factor /—g 
ensures the invariance of 4-volume (see Appendix A). One could in principle add to 
the action (2.20) other invariant terms vanishing in the Minkowski limit, like 


I diry Zgg"” RY Faa Fpp (2.21) 


a 
2 
Mp, 


6S = 


where R* is the Ricci tensor, a is a dimensionless constant, and the factor M a 
is included on dimensional grounds. However, terms like (2.21) are negligibly small 
compared to (2.20) if the space-time curvature is small compared to the Planck 
value, |Ruv| < M};,, and the constant a is not extremely large. Therefore, at all 
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classical stages of the evolution of the Universe, when its parameters are far from 
Planckian, freely propagating photons are described by the action (2.20). 

Consider now the propagation of a photon in the homogeneous isotropic 
Universe. Realistically, the photon wavelength is small compared to the spatial 
curvature radius, even if the Universe is open or closed. Therefore, the Universe can 
be considered spatially flat, and one can use the metric 


ds? = dt? — a?(t)6,;da'da. (2.22) 


It is convenient to use conformal time 7 instead of cosmic time t. The former is 
defined by 


dt = adn, (2.23) 


In terms of this new time coordinate the metric is 


i.e., 


ds? = a° (n)[dn? — 6;;dx' dx’). (2.24) 
In other words, 


Juv = a?(n) Mu (2.25) 


where nay is the Minkowski metric. The metric (2.25) differs from the Minkowski 
metric by overall time-dependent rescaling, i.e., the metric of homogeneous isotropic 
Universe has conformally flat form in coordinates (7, 2’). In these coordinates one 
has 


g” =a nt’, yg = aÀ. 


By substituting these expressions into (2.20) we obtain that in coordinates (n, x*) 
the action of electromagnetic field coincides with 


1 
S= -{ fae nt” n™ Fpa Fup- (2.26) 


This property is inherent in a theory of massless vector field; in theories of other 
fields, the action in conformal coordinates (n, x) in general does not reduce to the 
flat space-time action. In this regard, free electromagnetic fields is called conformal, 
while other fields, generally speaking, are not conformal. 

It follows from (2.26) that solutions to equations for the free electromagnetic 
field in the Universe with metric (2.22) (or, equivalently, with metric (2.24)) are 
superpositions of plane waves 


a) _ (a) ,ikn—-ikx 
A9 = ela) gihn ites, 


where k is a constant vector (called coordinate momentum, or conformal momen- 
tum), k = |k|, and el” are the standard polarization vectors of a photon, a = 1,2. 
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We stress that k is not the physical momentum of a photon, and k is not the physical 
frequency, since dx and dy are not physical distance and physical time interval. The 
quantity Ax = 27/k is the coordinate wavelength of a photon, while the physical 
wavelength at time t according to (2.22) is 


A(t) = a(t)Aa = 2 (2.27) 


Similarly, Aņ = 27/k is the period of electromagnetic wave in conformal time, while 
according to (2.23), the period in physical time? t is 
t 
T =a(t)An= a (2.28) 
Hence, the physical momentum p and physical frequency of a photon at time t are 


(2.29) 


In expanding Universe, the scale factor a(t) increases in time, the physical wave- 
length (2.27) grows, while the physical momentum and frequency decrease: they get 
redshifted. If a photon was emitted at time t; with given wavelength A; (e.g., in 
the process of transition of hydrogen atom from excited to ground state), then its 
wavelength at the Earth is equal to 


As usual, the index 0 refers to the present value of the relevant quantity. 
The quantity 


2(t)=—2 -1 (2.31) 


is called redshift. The further is the object emitting photons, the longer these 
photons travel through the Universe to the Earth, the smaller is a(t;): objects at 
larger distances have larger redshifts. The redshift is a directly measurable quantity; 
its measurement boils down to the indentification of an emission (or absorption) 
lines and determination of their actual wavelengths; see Chapter 1. 

Let us stress that formulas (2.30) and (2.31) are general; they are valid at all z. 

For not so distant objects, the difference (to — t;) — the photon travel time — 
is not very large, and we can expand in (to — ti), 


a(ti) = ao — a(to)(to — ti). 


?Note that we assume here that the period T is much smaller than the time scale of the evolution 
of the scale factor. This is of course true at all cosmological epochs of interest. 
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In terms of the present value of the Hubble parameter Hp = H (to) one obtains 
a(t;) = ao{1 — Ho(to — ti)l. 
Hence, to the linear order in (to — ¢;) one has 
z(ti) = Ho(to — ti). 


Finally, the travel time is equal to the distance to the emitter, up to corrections of 
order (to = ti, 


r= to = ti 
In this way one obtains the Hubble law, 
z=Hor, z<1. (2.32) 


When deriving it, we assumed that (to — ti) is not large; this corresponds to small 
redshift, as we indicated in Eq. (2.32). 

The Hubble parameter Ho is one of the fundamental cosmological parameters of 
the present Universe. We have already discussed its importance in Chapter 1. Here 
we recall that the measured value is given by (1.5); for the definition of the related 
parameter h see (1.7), while the associated time and distance scales are presented 
in (1.9a) and (1.9b). 

To end this Section, let us make the following comment. Our derivation of 
Eqs. (2.27)—(2.29) and, accordingly, Eq. (2.30) was based on the fact that elec- 
tromagnetic field is conformal, i.e., in coordinates (7, x‘) its action reduces to the 
action in Minkowski space-time. However, these formulas are of general character; 
they are valid for all massless particles. 

Consider as an example a massless scalar field with action 


S= J d rygg" 0,00, 0. (2.33) 


In conformal coordinates (7, x’), the explicit form of this action in space-time with 
metric (2.24) is 


S= 5 / d adn a? (n) 0,00). (2.34) 


The scalar field with action (2.33) is not conformal: the action (2.34) does not 
reduce to the flat space-time action by any change of variables. By varying (2.34) 
with respect to ¢ we obtain the equation for the scalar field in metric (2.24), 


1 
On (a On) — 0;0;¢ = 0 (2.35) 


(summation over i = 1,2,3 is assumed; 0;0; is Laplacian in flat 3-dimensional 
space). In the first place, let us notice that the operator in the left-hand 
side of this equation — covariant d’Alembertian — does not depend on zf, 


44 Homogeneous Isotropic Universe 


so the solution can be written as a linear combination of 3-dimensional plane 
waves, 


The factor a~(7) is introduced for convenience; due to this factor, Eq. (2.35) 
becomes the equation for f(7) without first-order time derivative, 


82 
af- ftkf=o. (2.36) 


If the expansion rate of the Universe and its time derivative are small compared to 
the frequency, the second term in Eq. (2.36) can be neglected, and the solutions to 
the scalar field equation are superpositions of plane waves 


o= gtk miler, (2.37) 
a(n) 

Coordinate frequency and momentum, k and k, are again independent of time, 
which again leads to Eqs. (2.27)-(2.29), as promised. Note that the factor a~!(7) in 
the solution (2.37) compensates for the factor a?(7) in the Lagrangian in (2.33); in 
other words, the action for the field f(x, n) = a(n)¢(x, n) reduces to the flat space- 
time action of massless scalar field up to corrections containing time derivatives of 
the scale factor. 


Problem 2.4. Give quantitative formulation of conditions under which the second 
term in Eq. (2.36) is small compared to the third term. Express these conditions in 
terms of the physical frequency, Hubble parameter and its derivative with respect to 
physical time t. 


Problem 2.5. Consider photons and massless scalar particles (actions (2.20) and 
(2.33), respectively). Show that the relation (2.31) between redshift and scale factor 
remains valid in open and closed Universes, provided that the wavelength X(t) is 
small compared to the radius of spatial curvature a(t) at all times between emission 
and absorption. 


2.4. Slowing Down of Relative Motion 


Physical momenta of massive free particles also decrease as 


a 


where k is time-independent coordinate momentum. To see this, let us consider the 


(2.38) 


geodesic equation 


du! 
— + eau = 0, (2.39) 
s 
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for 4-velocity of a free particle, 


da 
foe 
j ds 


(see Appendix A). Let us note right away that uf are not the physical values of 
spatial components of 4-velocity: since the physical distances dX‘ are related to 
coordinate distances by dX‘ = a(t)dzx'’, the physical components are 


dX? 
ds 


The physical momenta are expressed through the physical velocities in the usual way, 


p= mv’, 
while the usual 3-velocities 
i_ dX ¢ 
= H 
are related to U* by 
i vi 
U’ = ao (2.40) 


Indeed, the 4-velocities u” obey the relation g,,u"u"” = 1. In metric (2.25) the latter 
relation is 


(u°)? _ a?utut =| 


(summation over i is assumed), or 
dt \? a) 


dX? dX’ ds U’ 


d ds d viy U? 
which gives precisely Eq. (2.40). 
Let us come back to the geodesic equation (2.39). For metric (2.22), the only non- 
vanishing components of connection are given by (2.16) and (2.17). Thus, spatial 
components of Eq. (2.39) (u = i = 1, 2,3) read 


Therefore, 


—_ 


+Tpjuoul +Tiguiu® =0, 
s 


or 
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In terms of the physical components, the latter equation is 


Hence, velocities of free particles decrease in time as 


yi= const 
alt)? 


i.e., the law (2.38) is indeed valid. 

Thus, the velocities of free particles with respect to comoving frame decrease in 
expanding Universe; particles gradually “freeze in”. In particular, if at early cosmo- 
logical stages massive particles were relativistic, they become non-relativistic at later 
stages. This behavior is characteristic of neutrino, if its mass is well above 1074 eV. 

To conclude this Section, let us give another derivation of Eq. (2.38). This deriva- 
tion has to do with solutions to massive field equations in expanding Universe. 
Consider as an example the theory of massive scalar field with action 


s= | dava (Fat? a,00.0- or). 


In metric (2.24) this action has the form 


1 
S= [een C = > g) . 
An equation obtained from this action by varying with respect to ¢ is the Klein- 


Gordon equation in expanding Universe (in conformal coordinates) 


=50) (00,8) — 00:6 + m7a7¢ = 0. (2.41) 


This equation again does not contain spatial coordinates explicitly, so its solutions 
are again superpositions of plane waves 


where k is independent of time. The physical wavelength of a particle is thus given 
by (2.27), so its physical momentum indeed redshifts according to (2.38). 
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Problem 2.6. Show that for slowly varying scale factor, solutions to Eq. (2.41) are 
superpositions of 


$ = if Q(n)dn „—ikx , 14+ 0(0 
a(n) (On) hen 


where Q(n) = yk? + m?a?(n). Thus, coordinate frequency (derivative of the expo- 
nent with respect to conformal time) equals Q(n), while the physical frequency is 


wo) = S = phen 


as should be the case in relativistic physics. 


2.5. Gases of Free Particles in Expanding Universe 


Let us consider the behavior of gases of stable non-interacting particles in a homo- 
geneous isotropic Universe. We will discuss local properties like number density, 
so we again neglect the spatial curvature and use metric (2.22). The gas of parti- 
cles is characterized by the distribution function f(X,p), so that f(X,p)d*Xd°p 
is the number of particles in physical volume element d*X in interval of physical 
momenta d*p. The distribution in general depends on time and is not one of the 
equilibrium distribution functions. We will consider homogeneous gases, whose dis- 
tribution functions are independent of coordinates and depend only on momenta 
(and time). 

We have seen in previous Sections that coordinate momenta of free particles k 
are independent of time. The coordinate volume d?x is also constant in time, so the 
distribution function written in terms of coordinate momentum is time-independent, 


f(k) = const. 


The number of particles in an element of comoving phase space is also independent 
of time, 


f(k)d?xd?k = const. 
The comoving phase space volume coincides with the physical one, 
k 
dna = Plax) (£) = aXe. 
a 


Hence, the time dependence of the distribution function of free particles is entirely 
determined by the redshift of momenta, 


f(p,t) = f(k) = fla(t)- pl. 
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If the distribution function is known at some moment of time and equal to f;(p), 
then at later times it is equal to 


f(p,t) = fi |= p) . (2.42) 


ay 


We stress that, generally speaking, this formula is valid for free particles only. 

We will see that the known particles, as well as (most likely) dark matter particles 
actively interacted with each other in the early Universe, so that they were in 
thermal equilibrium. At some moment of time (different moment for different types 
of particles) the Universe expanded to the extent that the density and tempera- 
ture became sufficiently small, and interactions between particles switched off. At 
that moment the particles had thermal distribution function, and after that the 
distribution function evolved as given by? (2.42). 

Let us consider in more detail two limiting cases. We begin with massless 
particles; the most interesting among them are photons after they decouple from 
other particles (i.e., after recombination). At the time t; when their interaction with 
matter switches off, they have Planckian distribution function. The latter depends 


only on the ratio of momentum and temperature at that time, lel, 
[p] 1 1 
(p) = i e . 2.43 
ho) = f (R!) = oS (2.43) 


For massless fermions this would be the Fermi—Dirac distribution at zero mass (see 
Chapter 5), which also depends only on the ratio lel, According to (2.42), the 
distribution function at later times is 


oa =n (ER) =n) 


Qi 

— T.i. 
a(t) * 
Hence, the distribution function always has the equilibrium shape despite the fact 
that photons are not in thermal equilibrium. It follows from (2.44) that relic photons 
have Planckian spectrum with effective temperature decreasing in time according to 

1 

a(t) 
We will see in what follows that the same behavior (with reservations) is character- 
istic also of the distribution function of photons in the early Universe, when photons 
are in thermal equilibrium with matter. 


where 


T= (2.44) 


Teg (t) x (2.45) 


3This statement is not exact for various reasons. One is that there are gravitational potentials 
in our Universe which are produced by galaxies, clusters of galaxies, etc. This fact is to good 
approximation irrelevant for relic photons, but it is very relevant for massive particles. The latter 
obeyed Eq. (2.42) at sufficiently early cosmological stages only, when the inhomogeneities in the 
Universe were small. 
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It goes without saying that the relation (2.45) is valid for other massless parti- 
cles, including fermions (if there are massless particles in Nature besides photons; 
gravitons are an entirely different story). Importantly, the effective temperature 
decreases according to (2.45) from the time at which these particles get out of 
thermal equilibrium with cosmic medium (time of decoupling). At later times their 
effective temperature may be different from the photon temperature. We encounter 
this situation in Chapter 7. If particles have small but non-zero mass m and at the 
time of decoupling were relativistic, their spectrum remains thermal, and effective 
temperature falls as a~'(t) as long as T. >> m. At later times the spectrum is 
no longer thermal. Indeed, the distribution in momenta is always Planckian (if the 
particles are bosons), i.e., the distribution function has the form (2.43) with tem- 
perature Te¢(t) x a~'(t), while at Tep < m the equilibrium distribution is the 
Maxwell—Boltzmann distribution. Relic particles of this sort are called “hot dark 
matter”; it is hot in the sense that the particles decouple being relativistic and 
hence they always have massless distribution function in momenta.* An important 
example of hot dark matter is the relic neutrino. 


Problem 2.7. Consider particles of mass m that decouple at T >> m. Show that 
their distribution in momenta is given by (2.43) at all times afterwards, including 
late times when Teg <m. 


Let us now consider another limiting case of particles which decouple being non- 
relativistic. At the time of decoupling they have the Maxwell—Boltzmann distribu- 
tion function (see Chapter 5) 


1 m — Ui 2 
f(p) = Ons exp(- = ) exn(- 2z) 


where T; and p; are the temperature and chemical potential at that time. According 
to (2.42), the distribution function at later times is equal to 


Fo) = gag em(-" a) (ST 


It can again be written in the Maxwell-Boltzmann form, 


M — [ef 2 
f(p,t) = TFP (=) ep(- z) (2.46) 


where 


4The same property holds for “warm” dark matter; the distinction between hot and warm dark 
matter is discussed in Sec. 9.1. 
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and effective chemical potentail jeg is determined by the relation 


mM — Hep (t) _ m— bi 


Te j 7 Ti 
Equation (2.46) means that the distribution function still has the equilibrium formë 
(although particles are no longer in thermal equilibrium), and the effective temper- 
ature changes as 


1 
Teg (t) x =. 2.47 
eff ( ) x a2 (t) ( ) 
It decreases faster than in the case of massless particles. Dark matter particles which 
were non-relativistic at their decoupling are called cold dark matter. As we pointed 
out in Chapter 1, cold (or possibly warm) dark matter makes about 25% of total 
energy density today. 


5See footnote 3 in this Chapter. 


Chapter 3 


Dynamics of Cosmological Expansion 


3.1. Friedmann Equation 


The law of the cosmological expansion, i.e., the dependence of the scale factor a on 
time, is determined by the Einstein equations (see Appendix A), 


1 
Ruy — 59uvR = 81GT yy. 


Let us find their explicit form for homogeneous and isotropic metric (2.13). We 
begin with the calculation of the Ricci tensor, 


Ruw = an, B On + B ig _ Tào Ao (3.1) 


The non-vanishing components of the Christoffel symbols are given by (2.16)-(2.18). 
These formulas imply, in particular, that 


p Üi oð pu _(8)pi 
Phy = 70i 535 PhO. 


Let us first calculate Roo. Since Tho = 0, the only contributions come from the 
second and fourth terms in (3.1), and we obtain 


Roo = -aTi = TTia = — 3TA = riri 
a er å a\* 
--a()- (G49 (8) 6) 
a a a a 


Fie (3.2) 
a 


Finally, 


Let us now turn to the mixed components Roi. Keeping only non-vanishing Christof- 
fel symbols in (3.1), we write 


j à j pÀ k pj 
Roi = Oh a TrA + Ml jr = TiTi (3.3) 
This expression in fact is equal to zero, since D. are independent of spatial coordi- 


nates, Tà =r j are calculated with static metric yi; and hence are independent 
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of time and Ti; og ô$, which leads to the cancellation of the last two terms in (3.3). 
Hence, 


ie =0- (3.4) 


This should have been expected, since Ro; transform as components of a 3-vector 
under spatial rotations, and there is no special direction in isotropic space. 

Let us finally calculate the spatial components Rij. We again keep non-vanishing 
Christoffel symbols in (3.1) and write 


Rij = (Ool?; + ATE) — OT}, 
+ YLE +5 P25) — CRP oo + POG, + PGP), (3.5) 


where we collected in parentheses the terms that come from each of the four terms 
in (3.1). Taking into account (2.18), we combine into the Ricci tensor ©) R;; those 
terms in (3.5) which contain spatial derivatives only; the tensor (°) R;; is calculated 
with the 3-dimensional metric 7;;. Four other terms are calculated directly, and we 
obtain 


: : a, Gp Wap, 
Rij = 0o(Ga) Vi + aayiz ` 3— — aayik ` <5; = — 6; ayr + (3) Rij. 


Finally, 
Rij = (Ga + 24? + 2x) yi;, (3.6) 
where we made use of relation (2.9). 
Let us now use Eqs. (3.2), (3.4) and (3.6), to find the scalar curvature, 
7 1. 
R= gh’ Ru = g% Roo + g” Rij = Roo — qa) Baj: 


Since y" yi; = 3, we have 


.. . 2 
a à un 
R=-6(-+ 3+ 7). 
(: ret z) 
As a result, the 00-component of the left-hand side of the Einstein equations takes 
the simple form, 


1 à? x 
Roo — zok =3 (5 + =) 4 (3.7) 


The other components of the Einstein tensor G, = Ruy — 49u R are calculated in 
a similar way. 

Let us now turn to the right-hand side of the Einstein equations. At cosmolog- 
ical epochs of interest to us here, matter in the Universe can be described macro- 
scopically: it can be considered as homogeneous fluid with energy density p(t) and 
pressure p(t). This fluid as a whole is at rest with respect to comoving reference 
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frame, so the only non-zero component of its 4-velocity is u°. From the relation 
guvu”u” = 1 we have 


w=1, uw=1. 
Hence, the 00-component of energy-momentum tensor is (see Appendix A) 
Too = (p + p)uouo — gooP = P- (3.8) 


Combining (3.7) and (3.8) we conclude that the 00-component of the Einstein 
equations has the following form for homogeneous and isotropic Universe, 


à\? 8r nx 


This is called Friedmann equation; it relates the rate of the cosmological expansion 
(the Hubble parameter H = å/a) to the total energy density p and spatial 
curvature. 

The Friedmann equation has to be supplemented with one more equation, since 
Eq. (3.9) contains two unknown functions of time, a(t) and p(t). To obtain the 
additional equation, it is convenient to consider the covariant conservation of the 
energy-momentum tensor (see Appendix A) 


VaT” =0., 
Setting v = 0, we write 
VaT? 20,7 +E, T” +r T" = 0. (3.10) 
Non-vanishing components of the energy-momentum tensor in the comoving 
frame are 
T = gg Too = p, (3.11) 
TY = g" g” Ta = g" g" (oup) = VP. (3.12) 


Here we made use of the fact that the spatial components of 4-velocity are zero 
in the comoving frame, so that T;; = (p + p)uiuj; — pgij = —pgij: Note that y” 
entering (3.12) are time-independent components of the matrix inverse to yij. We 
again use the expressions for non-zero Christoffel symbols, Eqs. (2.16)-(2.18), and 
write Eq. (3.10) in the following explicit form, 


p+3-(p+p) =0. (3.13) 


To close the system of equations determining the dynamics of homogeneous and 
isotropic Universe, one has to specify the equation of state of matter, 


p = p(p). (3.14) 
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The latter equation is not a consequence of equations of General Relativity. 
The equation of state is determined by matter content in the Universe. For 
non-relativistic particles one has p = 0, while p = ip for relativistic particles and 
p = —p for vacuum; see Sec. 3.2. 

Equations (3.9), (3.13) and (3.14) completely determine dynamics of the cos- 
mological expansion. Let us make two remarks concerning Eqs. (3.13) and (3.14). 
First, if there exist several types of matter in the Universe which do not interact 
with each other, then the energy-momentum tensor of each type of matter obeys 
the covariant conservation equation independently. Therefore, Eqs. (3.13) and (3.14) 
must be satisfied by every type of matter separately. On the other hand, the energy 
density p in the Friedmann equation (3.9) is the sum of energy densities of all types 
of matter. Second, if matter in the Universe is in thermal equilibrium at zero chem- 
ical potentials, then Eq. (3.13) has simple interpretation. It can be written in the 
following form, 

SO gafa): (3.15) 
pt+p 
The left-hand side of this equation coincides with d(Ins) where s is the entropy 
density, see Chapter 5. Hence, Eq. (3.13) reduces to the relation 


sa? = const, 


which means the entropy conservation in comoving volume. In other words, due to 
the expansion of the Universe, entropy density decreases as the inverse element of 
the comoving volume, s x a~*. We see in Sec. 5.2 that the same interpretation 
holds for systems at non-vanishing chemical potentials too. 

To conclude this Section, let us make two comments. First, we note that when 
obtaining the equations governing the dynamics of the cosmological evolution we 
made use of only one Einstein equation, Rog — tgo R = 87GT oo, and only one 
of the covariant conservation equations, V,,T“° = 0. One can show that other 
Einstein equations and covariant conservation equations are identically satisfied for 
solutions to Eqs. (3.9) and (3.13). Nevertheless, let us write for future reference 
the equation that follows from the ij-components of the Einstein equations. It is 
clear from (3.6) that 7j-components of the Einstein tensor are proportional to Yiz. 
Furthermore, the ij7-components of the energy-momentum tensor Tij = —p gij are 
also proportional to y;;. So, all 7j7-components of the Einstein equations are reduced 
to a single Raychaudhuri equation, 

a à? x 
27 + 27 —8rGp — Pe (3.16) 
We do not use this equation in this book; it can be viewed as a consequence of the 
Friedmann equation (3.9) and covariant conservation of energy. 

The second comment is that the assumption that matter filling the Universe is 

ideal fluid is in fact unnecessary for deriving Eqs. (3.9), (3.13) as well as Eq. (3.16). 
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The cosmological model we discuss is self-consistent if matter is homogeneous and 
isotropic. The isotropy means that the energy-momentum tensor necessarily has 
the diagonal structure, Too = p, Tij = —p gij, where the parameters p and p are 
energy density and effective pressure. Homogeneity requires that these parameters 
are functions of time only, p = p(t), p = p(t). With this understanding, the analysis 
of this Section applies to the general case. 


Problem 3.1. Prove that equations 
1 1 
Roi — s90iR = 8rGTo:, Rig — ZIR = 8rGTij, VuT™ =0 


are identically satisfied in the case of homogeneous and isotropic Universe, provided 
that the following equations are satisfied, 


1 
Roo = x gookt = 87GToo, Vai = 0. 


Do not assume in the proof that the Universe is filled with ideal fluid; it is important 
only that matter is homogeneous and isotropic. 


Problem 3.2. Find equation of state in the model of a scalar field with the 
Lagrangian 


L=-VY/1—0,60"¢, Vo>O, 


assuming that the field is classical and spatially homogeneous (i.e., it depends on 
time only). 


3.2. Sample Cosmological Solutions. Age of the Universe. 
Cosmological Horizon 


Before discussing the realistic model of our Universe, let us present several examples 
of cosmological solutions. In this Section, we mostly consider a spatially flat model, 


x=Q. 


This is a very good approximation to the real Universe; we will see in what follows 
that the term x/a? in the Friedmann equation (3.9), if any, is small compared to 
the first term in the right-hand side at both present and early epochs. 

In the spatially flat model, the Friedmann equation takes the following form, 


(2) = Kep. (3.17) 


Simple solutions described in this Section are obtained for the Universe filled with 
one type of matter. Then Eqs. (3.13) and (3.14) (or, equivalently, (3.14) and (3.15)) 
determine the energy density as a function of the scale factor, p = p(a), and then 
the dependence of the scale factor on time is found from Eq. (3.17). Let us recall 
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(see Sec. 2.2) that in the case of a spatially flat Universe, the physically meaning- 
ful quantity is the ratio of scale factors taken at different times, rather than the 
scale factor itself. Therefore, one expects that a solution a(t) will contain an arbi- 
trary multiplicative constant. Also, Eqs. (3.13) and (3.17) are invariant under time 
translation, so a solution will contain another arbitrary constant, “the beginning 
of time”. 


3.2.1. Non-relativistic matter (“dust”) 


We begin with the model of the Universe filled with non-relativistic matter whose 
equation of state is 


p=0. 
We obtain from Eq. (3.15) 


a const. (3.18) 


a? 


Everywhere in this Section, “const” denotes an arbitrary positive constant (which is 
generally different in different formulas). Relation (3.18) has a simple interpretation: 
number density of particles n decreases as the comoving volume increases, and 
a°n = const, meaning the conservation of the total number of particles. Since the 
energy density is given by p = mn, where m is mass of a particle, the energy density 
behaves in the same way, i.e., pa? = const. 


With account of (3.18), Eq. (3.17) takes the following form, 
a\? _ const 
aj — aœ 


a(t) = const - (t — t,)?/3, (3.19) 


and has the solution 


where t, is an arbitrary constant. The Universe expands, and its expansion decel- 
erates, & < 0. The energy density is 


al) = = (3.20) 


t= t)? 
The solution (3.19), (3.20) is singular at t = t.: at that moment the scale factor is 
equal to zero (all distances are vanishingly small), and the energy density is infinite. 
This is an example of the cosmological singularity, the moment of Big Bang. We 
will see that many other cosmological solutions start from the singularity. Clearly, 
it does not make sense to extrapolate the classical evolution back to the Big Bang 
singularity. What we learn is that it is quite possible that the Universe started off its 
evolution from a state in which the energy density was very high (say, comparable 
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to the Planck energy density pp, ~ M$, ~ 107° GeV*), and for which classical laws 
of physics were not applicable. 

In what follows, we will count time from the cosmological singularity (if a solu- 
tion has this singularity). For the solution (3.19) and (3.20), this means that 


t, =0. 


Then t is the age of the Universe. Making use of (3.19), we relate it to the Hubble 
parameter, 


ae (3.21) 


Making use of Eq. (3.17) once again, we find 
3 1 1 

=. Ff? =— 

87G 6TG t? 
Equalities (3.21) and (3.22) relate the physical quantities; they do not contain arbi- 
trary parameters. 

If most of the evolution of our Universe occurred in the regime of domination of 
non-relativistic matter, the age of the Universe would be given by (3.21), i.e., 


p (3.22) 


2 
to = =. 2 
° 3H eee 
Making use of (1.8), we would obtain 
to = h-* -0.65 ~10" yrs = 0.93 - 10°? yrs (h = 0.7). (3.24) 


This value, even with an account of the uncertainty in the determination of Ho 
(see (1.5)), would contradict independent bounds on the age of the Universe, to > 
1.3 - 10!° yrs, which we mentioned in Chapter 1. We will see that the situation 
becomes comfortable if the expansion of the Universe today is determined mostly 
by dark energy. 

Let us make use of the solution (3.19) to introduce another important notion, 
the cosmological horizon. Imagine signals emitted at the moment of Big Bang and 
since then are traveling at the speed of light. We are interested in the distance ly (t) 
that such a signal travels from the point of its emission by the time t. The physical 
meaning of ly (t) is that it is equal to the size of the region (in infinite Universe!) 
causally connected by the time t: an observer living at the time t cannot know in 
principle what has happened outside the sphere of radius ly (t). This sphere is called 
the cosmological horizon, and ly (t) is the size of the observable part of the Universe 
at the time t. Clearly ly increases in time; the horizon opens up. 

Let us point out that another term for the cosmological horizon introduced here 
is “particle horizon”. This is to be distinguished from “event horizon” which is 
discussed in Sec. 3.2.3. 
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For calculating I(t), it is convenient to make use of conformal time 7, see 
Sec. 2.3. Light-like geodesics obeying ds? = 0 in metric (2.24) are described by the 
equation 


\dx| = dn. 


Therefore, the coordinate size of the horizon at time t is equal to ņn(t), and its 
physical size is 


l(t) = enO = at) | =. (3.25) 
For solution (3.19) we have 
a aa (3.26) 


If our Universe were matter dominated, the size of the horizon today would be 
equal to 


lio = EA 
o= 
We find numerically from (1.8) that 
lito = h~! - 6000 Mpc 
= 0.86 -10* Mpc = 2.6-107 cm (h = 0.7). (3.27) 


Yet another property of the horizon is the subject of the following problem. 


Problem 3.3. Show that signals emitted by sources which at time t are away from 
an observer at the distance ly(t), come to the observer at time t with infinite 
redshift. 


To conclude, in models with the cosmological horizon, the region in the Universe 
which is observable in principle, has a finite size even if the Universe itself is infinite. 


3.2.2. Relativistic matter (“radiation”) 


If the energy density in the Universe is due to relativistic matter, the equation of 
state is (see Chapter 5) 


In this case Eq. (3.15) gives 


const 
P= _ 


(3.28) 


a 


This behavior differs from (3.18) due to the fact that the expansion of the Universe 
leads not only to the dilution of the number density of particles, n x a~, but 
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also to the redshift of the energy of each particle, w x a~! (the latter properties 
are valid for both non-interacting relativistic particles, cf. (2.29) and particles in 
thermal equilibrium; see Sec. 5.1). 

Equation (3.17) becomes 


and has the solution 
a(t) = const - t/? 


(we again set the parameter ts equal to zero). The properties of this solution are 
analogous to those of the solution (3.19) and (3.20): the Universe undergoes decel- 
erated expansion; the moment t = 0 is the moment of the Big Bang singularity; the 
age is inversely proportional to the Hubble parameter (cf. (3.21)) 


and the energy density is inversely proportional to the age squared (cf. (3.22)), 


3 3 1 
s Ha, 
8rG 321G t? 


The horizon size is again finite, 


— a(t) f i Sipa m (3.29) 


It is useful to relate the expansion rate (the Hubble parameter) to temperature, 
assuming thermal equilibrium between all types of particles and neglecting chemical 
potentials. At temperature T the energy density is (see Chapter 5) 


T2 


pgi (3.30) 


where 
g= 90+ Eo 
b F 


is the effective number of degrees of freedom. Here summation runs over bosonic 
(b) and fermionic (f) particle species of masses smaller than T, gẹ and gy are the 
numbers of spin degrees of freedom of the boson b and fermion f. Using G = Mp? 
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(see Sec. 1.1) we write the relation (3.17) in the following form, 


T2 
— Mgr 


90 1 
M+, =,/——Mpri = ———M 3.32 
pi Srg | 166/q, °. ee 


is the reduced Planck mass. We will repeatedly use the relation (3.31) keeping 
in mind that the parameter Mý, depends on the effective number of degrees of 
freedom g, and hence on temperature (since a particle of mass m contributes to gs 
only at T = m). This dependence, however, is rather weak, and when discussing 
the early Universe at certain stages of its evolution one can often treat Mp, as a 


H (3.31) 


where 


constant. 

By comparing (3.28) with (3.30), we observe that the temperature of relativistic 
matter in thermal equilibrium is inversely proportional to the scale factor (up to 
slightly varying factor depending on gx), 
const 

a(t) 
Let us recall that the same relation (which in that case is exact) holds for the 


effective temperature of a gas of non-interacting relativistic particles, see Sec. 2.5. 
Finally, it is useful to note that Eqs. (3.31) and (3.33) imply 


T(t) © (3.33) 


T T? 
—72-H=— . 
T Mp, 


(3.34) 


The latter two relations, (3.33) and (3.34), are exact at those periods of the cosmo- 
logical evolution in which the effective number of degrees of freedom gą does not 
change. 

3.2.3. Vacuum 


In flat space-time, vacuum is the same in all inertial reference frames. It may have 
non-zero energy density, and the Lorentz-invariance dictates the form of its energy— 
momentum tensor, 


Tv = piadar: (3.35) 


The vacuum energy density is equal to Too = Puac, while effective pressure, defined 
by Ti; = —pmij, is 


P= -—Pvac: 


Thus, vacuum has an exotic equation of state p = —p; for positive energy density, 
the vacuum pressure is negative. 
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In the curved space-time of not very high curvature, the expression (3.35) 
remains valid in any locally-Lorentz frame, and in an arbitrary frame one has 


Thy = PvacJpv- (3.36) 


Here Pvac is a constant in space and time; in principle it should be calculable in 
a complete theory of elementary particles and their interactions. Up to now, no 
compelling calculation of the vacuum energy density has been made, and this is one 
of the major problems of fundamental physics. 

Time-independent Pyac is consistent with Eq. (3.13), which for p = —p gives 
p=0. This is in fact obvious: Eq. (3.13) is a consequence of the covariant 
conservation of the energy-momentum tensor, while the tensor (3.36) obeys 
covariant conservation law for pyac = const due to the fact that Vag? = 0 (see 
Appendix A). 

One can look at the right-hand side of the Einstein equations with Ty = const - 
Juv from a slightly different prospective. General covariance allows us to add to the 
Einstein—Hilbert action Sg of General Relativity, the term 


Sa = -A f vV=a'z. 


By varying the action (Sg + S4) with respect to metric one obtains, in the absence 
of matter, the following equations (see Appendix A), 


1 
Rup _ 59k = 8TGAGur = 0. 


They are exactly the same as the Einstein equations with energy-momentum ten- 
sor (3.36), with identification A = pyac. Historically this route was the first, and 
the parameter A is often called the cosmological constant for historical reasons. The 
difference between the cosmological constant and vacuum energy density is purely 
philological, at least at the current level of understanding of this issue. 

The solution to the Friedmann equation (3.17) with p = const = Pyac is 


a = const - east, (3.37) 


| 8T 
Aas = 3 GPvac 


is independent of time. The space-time with metric 


where the Hubble parameter 


ds? = dt? — e*Hastq,? (3.38) 


is called de Sitter space.! It is the space-time of constant curvature. 


'The coordinates (t,x) cover only half of the de Sitter space, see [36] and the problem in this 
Section. 
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Problem 3.4. Show that de Sitter metric obeys (cf. (2.7)) 


Ruvrp = =H; (9ur9vp — IupJvr): 


Unlike in previous examples of cosmological solutions, the Universe undergoes 
accelerated expansion, @ > 0. Furthermore, de Sitter space does not have the initial 
singularity: even though a(t) tends to zero as t => —oo, the metric can be made 
non-singular at any t by a coordinate transformation. 


Problem 3.5. Consider a fictitious flat five-dimensional space with metric 


ds? = (dy)? — (dy")? — (dy?)? — (dy®)? — (dy*)’. 


Consider a hyperboloid embedded into this space according to equation 


(y°)? — (yt? — (y?)? — (y?)? — (y*)? = —H~? = const. 


It is clear that this hyperboloid does not have singularities. Let us choose the coor- 
dinates (t, x’), i = 1,2,3 on this hyperboloid, such that 


A 
y? = -H~ tsinh Ht — —x?e#t, 
y =r, (3.39) 


H 
yt = H~'coshHt — geen 


Show that with this choice of coordinates, the metric induced on the hyperboloid 
from the five-dimensional space coincides with (3.38). What part of the hyperboloid 
is covered by the coordinates (t,x)? 


In the case of de Sitter space, the cosmological (particle) horizon is absent. The 
“beginning of time” is shifted to t = —oo. So, instead of Eq. (3.25) we have for the 
horizon size, 


t dt t i 
lat = a(t) f )= etast f dt'e Hast’ = gp, 


-œ a(t’) 


which precisely means that the particle horizon is absent. 

For spaces like de Sitter, another notion of horizon is introduced. It is quite 
different from the particle horizon discussed in Sec. 3.2.1. Namely, let there be an 
observer at the point x = 0 at the moment of time t. Let us ask what is the size of 
the region from which signals emitted at that moment of time will never reach the 
observer (which will stay at the point x = 0) in arbitrary distant future. Since the 
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time-like geodesics obey |dx| = dn, the coordinate size of this region is 


ult 00) =n = [> 


t 
and its physical size at time t equals 
DE 1 
las = a(t — = — . 3.40 
ws=a) f To = a (3.40) 


The observer will never know about events that happen at a given moment of time 
at distances exceeding lag = H ey this is the meaning of the de Sitter horizon. It is 
also called event horizon. 


Problem 3.6. Show that the solutions of Secs. 3.2.1 and 3.2.2 do not have event 
horizons. 


3.2.4. General barotropic equation of state p = wp 


Let us continue with the brief discussion of a model with the matter equation of 
state 


p = wp, 


where w is a constant larger than —1. Non-relativistic and relativistic matter 
correspond to w = 0 and w = 1/3, respectively. Models with negative (and, gener- 
ally, time-dependent) w attract considerable attention in recent years; matter with 
this effective equation of state has different names: quintessence, time-dependent 
A-term, etc. 

At w > —1 the solution to Eq. (3.13) is 


const 


We find from Eq. (3.17) that 
a = const: t”, 


where 


The parameter œ is positive; there is the cosmological singularity at t = 0. The 
energy density behaves as 


const | 
p= a} 


it tends to infinity as t —> 0. Since 


ä = const - ala — 1)t®~?, 
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the expansion of the Universe decelerates (å < 0) for œ < 1 and accelerates at 
a > 1. In terms of the equation-of-state parameter w we have 


1 
(a) w>-z: deceleration, 


1 
(b) a acceleration. 


Note that in the open Universe dominated by negative spatial curvature, the effec- 
tive energy density is equal to p = const/a? (see Eq. (3.9) with x = —1). This 
would mean that w = —1/3 (see (3.41)). In that case the expansion would neither 
accelerate nor decelerate, a = 0. 

The cases (a) and (b) differ also in the following respect. The models from the 
class (a) have cosmological (particle) horizon and do not have event horizon, while 
the situation is reverse for models from the class (b). Indeed, the particle horizon 


exists if the integral 
is dt! 
o a(t’) 


converges (see (3.25)). For a < 1 (i.e. w > —4) this integral converges indeed, while 
for a > 1 (i.e. w < —3) the integral diverges at the lower limit of integration. In the 
latter case the particle horizon gets shifted to spatial infinity. The existence of event 
horizon is determined by the convergence properties of the integral (see (3.40)) 


© dt! 
| a(t!) 
It diverges at the upper limit of integration for œ < 1 (event horizon is absent) and 
converges for a > 1 (event horizon exists). 


Problem 3.7. Study the cosmological evolution in a model with equation of state 
p= wp, where w is time-independent and w < —1. 


Problem 3.8. Is it possible in expanding Universe with equation of state p = p(p) 
to evolve from the regime (p+ p) > 0 to the regime (p+ p) < 0 without violating the 
real-valuedness of the sound speed us defined as u? = Op/Op? 


3.3. Solutions with Recollapse 


For completeness, we briefly discuss in this Section the homogeneous and isotropic 
solutions, in which expansion of the Universe is followed by contraction (recollapse). 
This situation occurs if the right-hand side of the Friedmann equation (3.9) contains 
both positive and negative terms, and if positive terms decrease with growing scale 
factor faster than the absolute values of negative terms. Examples of possibly inter- 
esting negative contributions are the contribution of curvature in the closed model 
(x = +1) and the dark energy contribution. Regarding the latter, we know that it 
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is positive at the present epoch, but we cannot exclude that it depends on time and 
will become negative in distant future. 

As an example, let us consider the closed cosmological model with non- 
relativistic matter. Making use of (3.18) we write the Friedmann equation 


a\* a 1 

m 
(2) ge (3.42) 
where the parameter am is determined by the total mass of matter in the Universe. 


At a < am the Universe expands in the same way as in the spatially flat case 
(Sec. 3.2.1). The expansion terminates when 


a= am; 
at that time the right-hand side of (3.42) vanishes. 


Problem 3.9. Find the relation between am and the total mass in the closed 
Universe. What would be the maximum size of the Universe having 1 kg of non- 
relativistic matter? 


The explicit solution has a simple form in terms of the conformal time 7 defined 
by dt = adn (see (2.23)). The Friedmann Eq. (3.42) then takes the form 


and its solution is 
a = am sin? 1, (3.43) 


The expansion begins from singularity at 7 = 0, the Universe has maximum size at 
n =n, and at 7 = 2r it collapses back to the singularity. The relation between the 
physical and conformal time is 


am 
t= J aan = 50 — sin n). (3.44) 
Thus, the total lifetime and the maximum size are related by ttot = T ` am- 


Problem 3.10. Show that at a & am the solution (3.43), (3.44) indeed coincides 
with the spatially flat solution (3.19). 


Similar situation occurs in the case when the expansion of the Universe ter- 
minates due to negative A-term. The Universe exists for finite time between its 
emergence from and recollapse to the singularity. 


Problem 3.11. Find the law of evolution a = a(t) of the spatially flat Universe 
(x = 0) with negative, time-independent cosmological constant, assuming that 
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matter in the Universe has equation of state p = 0. Find the total lifetime. Hint: 
Make use of the Friedmann equation in physical time. 


Problem 3.12. Consider the Universe filled with matter whose equation of state is 
that of Chaplygin gas [37], 


A 
p=-—. (3.45) 
(1) Find the dependence of the Hubble parameter on the scale factor. 


(2) Find the law of evolution a = a(t) at small and large scale factors in all three 
cases, x = 0, +1. 


(3) Find the complete evolution a = a(t) in the case of spatially flat Universe. 

(4) What values of x admit static solutions to the Einstein equations? 

(5) What can be said about the future of the Universe, if it is known that at some 
moment of time the expansion of the Universe accelerates? Consider all three 
cases, x = 0, +1. 

(6) Consider a theory of a scalar field with action 


1 
Se= | dev= [žo - vo l 


In the case of spatially flat Universe, find the scalar potential V(@) for which the 
spatially homogeneous solution describes the same evolution as in the case (3), and 
the relation between the energy density and pressure has the form (3.45). 


Chapter 4 


ACDM: Cosmological Model 
with Dark Matter and Dark Energy 


4.1. Composition of the Present Universe 


Cosmological solutions studied in Sec. 3.2 are not realistic. Energy density in the 
present Universe is due to non-relativistic matter (baryons and dark matter, as well 
as those neutrino species whose mass is considerably larger than 5K ~10~° eV), 
relativistic matter (photons and light neutrino of mass less than 1074 eV, if such 
a neutrino exists, see Appendix C) and dark energy. In general, the Universe may 
have non-vanishing spatial curvature. Therefore, all these contributions have to be 
included into the right-hand side of the Friedmann equation (3.9), and this equation 
takes the form 


n 2 
a 81 
H’ = (ż) = z Clu + Prad + Pa + Peure)s (4.1) 


where Pm, Prad, Pa are energy densities of non-relativistic matter, relativistic matter 
(“radiation”) and dark energy, respectively, and by definition 


aC Rae = og (4.2) 


is the contribution due to spatial curvature. Let us introduce the critical density 
Pe by 

Dez D (4.3) 
We stress that we always use the notion of critical density only in application to the 
present Universe; for us pe is the time-independent quantity. Its meaning is that if 
the total energy density in the present Universe, Pm,o + pPrad,0 + Pao, equals precisely 
Pc, then the Universe is spatially flat (since in this case peurs = 0 and x = 0). The 
numerical value of the critical energy density is given in (1.14). We note that the 
average energy density in the present Universe is fairly small: it is equivalent to 5 
proton masses per cubic meter. 
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Let us introduce the parameters, 


7 ka 


Qu = Pm,o Qays Prad,O Qa = Pa,o One Pcurv,0 (4.4) 
Pe Pe Pe Pe 


Again, these parameters refer to the present Universe only, and by definition they 
do not depend on time. It follows from (4.1) and (4.3) that 


Xo; = Qu t Oraa t Qa t Qeurv =i (4.5) 


The parameters 2; are equal to relative contributions of different sorts of energy, and 
also of spatial curvature, into the right-hand side of the Friedmann equation (4.1) 
at the present epoch. Qs make one of the most important sets of cosmological 
parameters. 

It is rather straightforward to estimate the relative contribution of relativistic 
particles Qraa. It comes mainly from relic photons of temperature To = 2.726 K. 
According to the Stefan—Boltzmann law, their energy density is (see also 
Chapter 5) 


2 
TT 
Py,0 = 2—Ty, 


30 
where the factor 2 is due to two photon polarizations. Numerically, 
-10 GeV 
pag = 2.6 - 107}? ase 
and 
Q S95 abo: h= 0.7. (4.6) 


If there exist massless or light neutrinos (with m, <5K~10~% eV), their contri- 


~N 


bution is somewhat smaller than Q4. Hence 
nas 10, (4.7) 


Because of this, the effect of relativistic particles on cosmological expansion is neg- 
ligible today and in the late Universe. 

We mentioned in Chapter 1 that observations of CMB anisotropy imply that 
the spatial curvature of the Universe is either zero or very small. Quantitatively, 
there is the bound on Qeury, 


lQeurv| < 0.01. (4.8) 


We will discuss in what follows how this bound is obtained, and will simply use it 
for the time being. 

There are several independent observational methods to determine 2, and Q,. 
We discuss some of them in this and the accompanying book, while others are just 
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mentioned (see also Chapter 1). We quote here the current values, 
Qu = 0.315, Qa = 0.685 (4.9) 


with precision of about 5%. Hence, the present expansion of the Universe is deter- 
mined to large extent by dark energy, and to less extent by non-relativistic matter. 

The present energy density of non-relativistic matter is the sum of mass densities 
of baryons and dark matter, 


Ons = Qe a Dm, 
with 
Q, =0.050, Op = 0.265. 


At least two types of neutrinos have masses exceeding the temperature in the present 
Universe, see Appendix C. Neutrinos of these types are non-relativistic today. We 
discuss neutrino contribution Q, in Chapter 7, and here we assume for simplicity 
that it is negligible as compared to Qm. The number of electrons equals the number 
of protons by electric neutrality, so that! 


Me 


Qe & -Qs x 3-107. 


Mp 


Thus, the main contribution into Qu comes from dark matter. As we discuss in 
Chapter 9, dark matter is very likely cold. 

Spatially flat cosmological model with non-relativistic cold dark matter and 
dark energy with parameters close to (4.9) will be called? ACDM model. In what 
follows we will specify this model further, by adding more cosmological parame- 
ters. Let us make one qualification right now: unless we state the opposite, we will 
assume, in the framework of ACDM that p, is independent of time (cosmological 
constant = vacuum energy density). 

ACDM model is consistent with the entire set of cosmological data. This does 
not necessarily mean, of course, that this model is exact, or that no alternative 
models are possible. In any case, ACDM serves as an important reference point 
among numerous cosmological models. 

The relative contributions Q; to the right-hand side of the Friedmann equa- 
tion (4.1) are characteristic of the present epoch only, since prad, Pm, Pa aNd Peurv 
behave in time in different ways, namely prad X a~* (see (3.28)), pu x a`’ 
(see (3.18)), Peury « a? (see (4.2)), and, according to our assumption, pa is 


1 The first equality here is approximate, since there exist neutrons bound into nuclei and contribut- 
ing to Qs. This is not important for our purposes here. 

?In general, ACDM often denotes wider class of models. We will use this term in a narrow sense 
given in the text. This model is also dubbed in literature as “concordance model”. 
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independent of time. Thus, the Friedmann equation in the ACDM model can be 
written as follows, 


<2 
a = 8a ao 3 ao 4 ao 2 
(2) = z Ce fox (=) + Qrad (=) + Qa -+ Qeury (2) , (4.10) 
or 
H? = He[Qm(1 +z)? + Qraa(1 +F + Qa + Qeury(1 HF]: (4.11) 


A subtlety here is that the number of relativistic species, as well as the number 
of non-relativistic ones, is different at different cosmological epochs. In particular, 
neutrinos of mass my = 5 K are non-relativistic today, but they were relativistic at 
early stages of the evolution. This subtlety is not very important for this Chapter, 
but one should keep in mind that the Friedmann equation in the form (4.10) must 
be used with care. 

Another remark concerns dark energy. It cannot be excluded that pa in fact 
depends on time. In particular, one can consider dark energy with equation of state 
pa = wapa with wa Æ —1; in that case its energy density evolves with scale factor 
according to power law (3.41). Observational data show that wa belongs to the 
interval —1.25 < wa < —0.95. Most of the conclusions in this Chapter remain valid 
for such a dark energy, although formulas get more cumbersome for wa #4 —1. 
It is worth stressing that the question about time-dependence of pa is extremely 
important both for cosmology and particle physics, since it is directly related to the 
nature of dark energy: if pa = const, then the dark energy is cosmological constant, 
while time-dependence of pa would imply the existence of a new form of matter in 
Nature (e.g. fairly exotic scalar field dubbed quintessence). 


4.2. General Properties of Cosmological Evolution 


Let us discuss, first at the qualitative level, which contributions to the right-hand 
side of the Friedmann equation are most relevant at different cosmological epochs. 
In the first place, the contribution due to curvature has never dominated. Indeed, 
it follows from (4.8) and (4.9) that this contribution, if any, is presently small 
compared to contributions of both non-relativistic matter and dark energy. In the 
past, the contribution of non-relativistic matter was enhanced with respect to that 
of curvature by a factor ag/a = 1+ z, so that curvature was even less important. If 
the dark energy does not depend on time, the curvature contribution will be small 
in future as well: the spatial curvature will decrease as 1/a?, while p, stays constant. 

Talking about the future, let us note that all terms in the right-hand side of 
Eq. (4.10), except for Q4, decrease as a grows. Thus, the expansion rate of the 
Universe will be determined by dark energy, and the behavior of the scale factor 
will tend to the exponential law, (3.37) with pyac = Pa = PcQa. Of course, this 
conclusion holds for time-independent p, only. It is not known whether the latter 
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assumption is valid, so distant future of the Universe cannot be reliably predicted, 
cf. Sec. 1.4. 


Problem 4.1. Give examples of the evolution of dark energy leading to scenarios 
for the future of the Universe outlined in Sec. 1.4. In particular, find the bounds 
on maximum size and lifetime of the closed Universe assuming that dark energy 
instantaneously switches off right after the present epoch. Hint: Use the bound (4.8) 
and the values (4.9). 


We will be interested in the past of our Universe. Equation (4.10) shows that 
the major contribution to the present energy density comes from dark energy. This 
contribution, p,, has become important only recently. Before that, there was a long 
period of domination of non-relativistic matter (“matter domination”). At even 
earlier times, at sufficiently small a, relativistic matter was prevailing (“radiation 
domination”). Within the concepts developed so far in this book, the radiation 
dominated epoch began right from the cosmological singularity. This is precisely 
the Hot Big Bang picture. As we discussed in Sec. 1.7.1, this picture is certainly 
incomplete; nevertheless, we concentrate for the time being on the Hot Big Bang 
theory and study scenarios for the preceding epoch (inflation and alternatives) in 
the accompanying book. 

“Moments” when the regimes of evolution change are of considerable interest. 
Let us discuss them in some details. 


4.3. Transition from Deceleration to Acceleration 


We neglect the contributions due to relativistic matter and curvature and write 
Eq. (4.10) as follows, 


= Gp. — + osa?) ; 


This gives for acceleration 
An ao \3 
é = a—Gpe (29, Os (=) i 
3 a 
Since 2Q, > Qu and hence ä > 0, the expansion accelerates at the present epoch. 


In the past, at sufficiently large z = ao/a— 1, the expansion was decelerating, ä < 0. 
The transition from deceleration to acceleration occurred at 


ao E _ 20). 
Qac 7 Qu 


ar 
e= —1. 
(= 


i.e., at 
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50 


Fig. 4.1. Evolution of the time derivative of the scale factor inferred from the observation of 
standard candles and baryon acoustic oscillations [74]. Transition from decrease to increase as 
z decreases is precisely the transition from deceleration to acceleration. The theoretical curve is 
obtained for spatially flat Universe with h = 0.7 and Qa = 0.73. 


For Qu = 0.315, Q, = 0.685, we find numerically 
Zae Œ 0.63. 


Thus, transition from deceleration to acceleration occurred in the Universe pretty 
late. This transition is illustrated in Fig. 4.1, which shows the measured values of 
H(z)/(1+ z) = a(t)/ao. 


Problem 4.2. At what z did the contributions to energy density due to non- 
relativistic matter and dark energy become equal to each other? 


Since the dependence pw X a~? is quite strong, while p, depends on a weakly 
or does not depend on a at all, the transition occurred rather abruptly. Before 
the transition, the Universe expanded as a œ t?/3 (matter dominated epoch, see 
Sec. 3.2.1). 


Problem 4.3. At what z does the transition from deceleration to acceleration occur 
for dark energy with equation of state p = wp, w = const? For what value of the 
parameter w this transition would occur now? Give numerical estimate using the 
values (4.9). 


4.4. Transition from Radiation Domination to Matter Domination 


As we have already pointed out, in the Hot Big Bang theory the earliest epoch is 
radiation domination. The “moment” of the transition from radiation domination to 
matter domination (“equality”) is very important from the viewpoint of the growth 
of density perturbations: we show in the accompanying book that perturbations 
behave in a very different way at these two epochs. 
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A crude estimate for the moment of equality is obtained from Eq. (4.10) and 
estimates (4.6) and (4.9). Neglecting dark energy and curvature at this moment, we 
find that the contributions of radiation and non-relativistic matter were equal at 


ao Om 


~N 


Zeqgt1= ~ 10+. (4.12) 


Geq Qrad 
At that time the temperature in the Universe had the following order of magnitude, 
Teq = To(1 + 2eq) ~ 104 K ~ 1 eV. (4.13) 


Thus, equality occurred at rather distant past. 

The estimates (4.12) and (4.13) have to be refined. At temperature of order 
1 eV, not only photons, but also all three species of neutrinos are relativistic (see 
Appendix C). We see in Chapter 7 that neutrinos do not interact between themselves 
and with the rest of cosmic matter at this temperature. According to Sec. 2.5, their 
distribution functions are thermal nevertheless. We show in Chapter 7 that the 
effective temperature of neutrinos is 


4\¥83 
r= (=) T (4.14) 


where T} is the photon temperature. Everywhere in this book we identify the tem- 
perature in the Universe with the photon temperature, so that 


Ty =T. 


The energy density of relativistic neutrinos is given by the appropriately modified 
Stefan—Boltzmann law (see Chapter 5), 
TT? 


paso: nT, 
j 830 ” 


where the factor 3 corresponds to three neutrino species, factor 2 is due to the 
existence of both neutrino and anti-neutrino, one polarization each, and factor 7/8 
accounts for the Fermi statistics. Thus, the energy density of relativistic matter at 


the epoch of interest is 
21 4 NP] 7? 
ae (=) | T r4 (4.16) 


(4.15) 


Prad = Py + Pv = 30 4 


where the first and second terms in square brackets come from photons and three 
neutrino species, respectively. Hence, 


4 
Praa = 1.68p, = 1.68 (2) ype: (4.17) 


The energy density of non-relativistic matter is still given by 


ao 


Pu >= (Y Qube. (4.18) 
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We find that equality between relativistic and non-relativistic components hap- 


pens at 
1+ Zeg = = = 065 
and making use of (4.6) we obtain 
1+ zeg = 24-10" Oh, (4.19) 
which for Qw = 0.31 and h = 0.7 gives 
1+ Zeg = 3.5- 10°. (4.20) 
At that time, temperature is 
Teg = (14+ ey) Tp = 5.6 2h? eV (4.21a) 
Te =0.8 6V for h=0.7, Qu = 0.31. (4.21b) 


The expressions (4.19)-(4.21b) refine the estimates (4.12), (4.13) with account of 
three light neutrino species. 

Let us estimate the age of the Universe at equality. To simplify the calculation, 
we neglect non-relativistic matter before equality and note that during most of the 
evolution only photons and neutrinos were relativistic (the lightest of other particles, 
electrons and positrons, are relativistic at temperatures T > me = 0.5 MeV only). 
Therefore, we use formulas of Sec. 3.2.2 with effective number of degrees of freedom? 
g» obtained by comparing (3.30) with (4.16): 


4/3 
21/4 
ĝs=2+ ^ ( ) = 3.36. (4.22) 


Making use of (3.29) and (3.31) we estimate the age as 
on 1 Mh 
“o BH P2” 
eq eq 


(4.23) 


where, as before, M$, = Mp;/(1.66./g,.). Using (4.21a) and (4.22) we obtain for 
h=0.7, Qu = 0.31 


teg ~ 3.2-10°° GeV* = 2.1- 10's = 70 thousand yrs. (4.24) 


This time is of course much shorter than the present age of the Universe, to ~ 14 
billion years. We refine this estimate below, see Eq. (4.28). 

To conclude this Section we note that the definition of equality py = Praa 
is unambiguous. However, from the viewpoint of the cosmological evolution, the 
transition from radiation domination to matter domination is actually not a well 


3We reserve the notation g» for the parameter that determines the entropy density, see Sec. 5.2. 
The parameters g» and gx coincide at T 2 1 MeV, so we use the notation gẹ when discussing the 
Universe at so high temperatures. 
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defined moment in the history of the Universe, but the process whose duration is 
comparable to the Hubble time at that epoch, H F (in other words, comparable to 
the age teq). The ratio between energy densities of non-relativistic and relativistic 
matter depends on scale factor rather weakly, pyr/Praa X a, so this ratio does not 
change very much in Hubble time. Thus, the jump from the expansion law a œ t!/? 
to a x t?/3 at t = te, would be only a crude approximation. The relation (4.23) 
between the temperature Teq and age teq is approximate as well, as in our derivation 
of (4.23) we neglected non-relativistic matter at t < teg. 


Let us calculate the age of the Universe at temperatures higher than the neutrino masses 
but well below 100 keV. In view of the cosmological bound on the neutrino masses given in 
Sec. 7.2, Eq. (7.15), we are talking here about temperatures exceeding 0.2 eV. The relevant 
range of temperatures includes Teg as well as the temperature of recombination which we 
discuss in Chapter 6. 

Let us first write the relation between the age and scale factor: 


(a= [ 2-4 
Jy å Jo aH’ 


Starting from temperature of order 100 keV (a few minutes after Big Bang which we 
ignore), the composition of cosmic medium remains constant. Thus, the relation T œ at 
is exact, and we have 


t(T) = [ Ta (4.25) 


We neglect the dark energy contribution (this is a very good approximation at the epoch 


we consider) and write 
7 ee 
a \ Ty 2 \ Ta 


where Heq = H (Teq), the first and second terms in square barckets are contributions of 
non-relativistic and relativistic matter to the Friedmann equation, respectively; we make 
use of the fact that these contributions are equal to each other at T = Teg and that their 
sum at Teq is H? (Teq). Now the integral (4.25) is straightforwardly calculated, and we get 


(4.26) 


where 


2 T. 3/2 T, 172 4 
F(T/Ta)=3 (1+ 2) -2 (1+ 42) Fa 


The Hubble parameter at equality can be written as follows: 


8 1/2 
Hej = 2 G Z Gipu (Ta)| = vy 29m Ho(1 + nN 


which gives finally 


t(T) — Fi(T/Teq) 


= —_ Lu 4.27 
Ho VQ (1 + Zeq)?/? ( ) 
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We insert here F;(1) = (4 — 2\/2)/3, h = 0,7, Qm = 0,31, and obtain numerically 
teq = 52 thousand yrs. (4.28) 


This refines the estimate (4.24). 
Comoving horizon size at these temperatures is calculated in a similar way. We write 


We again make use of the relation (4.26) and obtain, upon calculating the integral, 


__v2 E FT) 
n(L) = = Hag T/T) - ovl + Zee) E (4.29) 


where 


We use this result in Sec. 6.4. 


Problem 4.4. Show that the expressions (4.27) and (4.29) reduce at T >> Teq and T K Teg 
to known expressions for the age and horizon size at radiation domination and matter 
domination, respectively. 


4.5. Present Age of the Universe and Horizon Size 


The fact that the cosmological expansion for fairly long time has been affected by 
dark energy leads to different values of the present age of the Universe and present 
horizon size as compared to the estimates (3.24) and (3.27). To calculate them, we 
neglect spatial curvature and relativistic matter: as we discussed in Sec. 4.1, the 
curvature is negligible at all epochs, while the relativistic matter is relevant at early 
times only, t S teg. Then Eq. (4.10) becomes 


£x2 
a\" a ao\ 3 
(ż) = H? fox ( ~ ) + ol, (4.30) 
where we made use of (4.3). Here 


Qutu =1, Qa >O. (4.31) 


The solution to Eq. (4.30) is 


a(t) = ao (= ) k sinh (Svat) i : (4.32) 


It is clear from this formula that the law of matter dominated expansion, a « t?/3, 
is restored at early times, while at late times the scale factor grows exponentially, 
as should be the case. 
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Fig. 4.2. Evolution of a = a(t) for spatially flat models. 


The present age of the Universe is now found from the following equation, 


Q 1/3 3 2/3 
( x) sinh (3 vmt) =1. 
Qa 2 


It is given by 


2. Looss [Oa 
to = — Arsinhy/——. 4.33 
0 3 ON Ho rsinn Gd, ( ) 


At Q, > 0 and Q,, — 1 we restore formula (3.24). For positive Q, the age exceeds 
2/(3Hp). To see this, we plot the scale factor as function of time for matter dom- 
inated model (Q, = 0) and ACDM model (Q, > 0) in such a way that they are 
tangential to each other (values of both a and å coincide) at the present time, i.e., 
at a = ao (equality of derivatives corresponds to one and the same value of the 
present Hubble parameter Ho = (@/a)o). Since the Friedmann equation for realistic 
model is given by (4.30) and the right-hand side for matter dominated model equals 
Hg(ao/a)*, the time derivative å is greater for matter dominated model at every 
a < a, so we come to the plot shown in Fig. 4.2. 

The distance along the horizontal axis from the point of singularity a = 0 to the 
point a = apo is precisely the present age; it is clear that the age is larger for the 
Universe with Q, > 0. We obtain from (4.33) 


to = 1.388 - 101? yrs for Qm = 0.315, Qa =0.685, h = 0.673. 


This age does not contradict the independent bounds discussed in Chapter 1. It is 
the presence of dark energy that removes the contradiction between the age of the 
Universe, calculated by using the measured value of the present Hubble parameter, 
and the bounds on this aged obtained by other methods. 


Problem 4.5. Consider an open model without dark energy (this model in fact is 
excluded by CMB data), in which Qm #0, Qeurv Æ 0, Qa = 0 and Qum + Qeury = 1. 
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Fig. 4.3. Age of the Universe t as function of redshift z in ACDM model with Q, = 0.7 h = 0.7. 


Find the present age of the Universe at given value of Ho. Calculate the numerical 
value for Qu ~ 0.3 (estimated from the studies of clusters of galaxies) and h = 0.7. 


Problem 4.6. Find the present age of the Universe for dark energy with equation 
of state p = wp, w = const. Give numerical estimates for w = —1.25 and w = —0.85 
with Qy = 0.3, Qa = 0.7. 


It is of interest to find the age of the Universe at different redshifts. We recall 
that 1 + z(t) = ao/a(t) and obtain from (4.33) 


2 1 QA 


t(z) = —— — Arsh , | ——. 4.34 
(2) 30x Ho rsh Tm(l +23 ( ) 


The function t(z) is plotted in Fig. 4.3. We emphasize that the formula (4.34) is 
valid only for z < 1000, when the energy density of radiation is negligible. 

The discussion of the cosmological horizon in ACDM model is not so instructive. 
Nevertheless, let us make the estimate. According to general formula (3.25), the 


present horizon size is 
dt 
lao = ao | ENA 
o a(t) 


Since a(t) x t?/3 at small t (see (4.32)), this integral is convergent at the lower limit, 
i.e., the cosmological horizon size is finite. For given Ho it is larger than the value 
2/ Ho obtained for the flat model with matter but without dark energy. Numerically, 
for Qum = 0.315, QO, = 0.685, the estimate is 


2 
lno =- -1.6 = 949 Gpe, h= 0.673. (4.35) 
Ho 
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Problem 4.7. Show that the horizon size is greater than 2/Ho in the model with 
matter and positive dark energy. Confirm the numerical value (4.35). 


To end this Section, let us make the following remark. The bound (4.8) for Qeurv 
together with the estimate (4.35) can be used to show that there are many regions 
of size ly o outside our horizon. Let us recall in this regard that in the Hot Big 
Bang theory these regions are causally disconnected. In any case, we cannot obtain 
any information on what is going on in these regions; as an example, relic photons 
traveled the distance slightly smaller than ly, o since the recombination epoch. 

Clearly, there is an infinite number of regions in question in open and flat models, 
so we are talking about closed Universe, 3-sphere.* It follows from the definitions 
(4.2), (4.3) and (4.4) that the radius of this sphere ao is related to Qeury as follows, 


1 


SS = AG Peurah: (4.36) 
ao 


Comparing this with (4.35) and using the bound (4.8) we find 


1 

2 Zo. 

lao  3.24/Reurv] 
Thus, the radius of the Universe is larger than the horizon size. This becomes even 
more striking if we calculate the number of regions like ours. The latter is equal to 
the ratio of the volume of the 3-sphere 277a3 and the volume of a region of radius 
lao: 

2ra? ( ao I 
N x ———2— = 4.7 | — ) >150. 4.37 

Nz o Tno (1.37) 
Hence, observational data show that we see less than 1% of the whole Universe. 
In the accompanying book, we give theoretical arguments suggesting that Qeurv 
is many orders of magnitude smaller than the observational bound (4.8), i.e., the 
number of regions outside our horizon is by many orders of magnitude larger than 
the bound (4.37). 


4.6. Brightness—Redshift Relation for Distant Standard Candles 


Let us discuss one of the important methods of determination of cosmological 
parameters, such as Ho, Qm, Qa and Qcury. This method is also capable of discrim- 
inating between the cosmological constant (vacuum energy) and time-dependent 
forms of dark energy. The method makes use of the simultaneous measurement of 
redshift z and visible brightness of “standard candles” which are at distances com- 
parable to the horizon size. These standard candles are very luminous objects whose 


4We assume here that the Universe is homogeneous and isotropic both inside and outside our 
horizon. 
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absolute luminosity is assumed to be known. Among the objects used as standard 
candles? are supernovae of type Ia (SNe Ia). 

Let us find the relation between redshift and visible brightness of a source with 
absolute luminosity (energy emitted in unit time) L. Although the analysis that 
follows (but not concrete results!) is straightforwardly generalized to time-dependent 
dark energy, let us assume, for the time being, that dark energy density p, does 
not depend on time. It is useful to keep spatial curvature not equal to zero and 
ignore the bound (4.8). Let us choose for definiteness open cosmological model with 
x = —l and Qeury > 0. The flat model is obtained in the limit Qeur, — 0, or, 
equivalently, ao — co, see (4.36). 

Let us use the form (2.10) of the metric, 


ds? = dt? — a?(t)[dx? + sinh? x(d0? + sin? 0d¢7)]. (4.38) 


As usual, the coordinate distance betwen the source emitting light at time t; and 
observer at the Earth at time to equals 


to 
y= I LN (4.39) 
t: 


Let us first find the relation between the coordinate distance and redshift z of the 
source. To this end, we use the Friedmann equation in the form (4.10), and neglect 
radiation. Changing the integration variable in (4.39) from t to 


we obtain 


Zz dz! 
I= [ TODEN 


Using Eq. (4.10) we cast this integral into 


” dz! 1 
x(z) = Í a (4.40) 
o aoHo /Ou(z! + 1)3 +O, + Qeure(2’ + 1)? 


This integral cannot be evaluated analytically, but its numerical calculation is easy. 


5 As everywhere in this book, we do not discuss astrophysical and observational aspects of this 
problem. In particular, we leave aside the issue about the nature of SNe Ia, the question of why 
they are candidates for standard candles, etc. 
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According to (4.38), the physical area of the sphere crossed by photons today is 
S(z) =4ar?(z), (4.41) 

where 
r(z) = ao sinh y(z). (4.42) 


The number of photons crossing unit surface at the observer’s position is inversely 
proportional to S, while the energy of each photon differs from the energy at emis- 
sion by the redshift factor (1+ z)~1. The same factor additionally enters the expres- 
sion for the number of photons crossing unit surface in unit time, since the time 
intervals for the source and observer differ by factor (1 + z)~!. The latter point 
can be understood as follows. In conformal coordinates (7,x) photons behave in 
the same way as in Minkowski space; see Sec. 2.3. Therefore, in these coordinates 
the time intervals between emission and absorption of two photons are equal to 
each other, dne = dno. This gives the relation between the physical time intervals, 
dto = (1 + z)dte. 
Thus, the visible brightness (energy flux at observer’s position) equals 


L 
J = —— r. 4.43 
(1 + z)?2S(z) ( ) 
This is the desired expression for the brightness—redshift relation for a source whose 
absolute luminosity L is assumed to be known. 
Let us introduce photometric distance rpn in such a way that the relation between 
L and J is formally the same as in Minkowski space-time, 
= b 
Anr?,, ` 
We find from (4.43) 
Tph = (1 + z) - r(z), (4.44) 


where r(z) is given by (4.42). 

At first sight, the relation (4.43) involves five cosmological parameters: Ho, ao, 
Om, Qa and Qcurv. In fact, the number of independent parameters is three in view 
of the relations (see (4.2) and (4.4), (4.5)) 


Qu tO, + Qeury = 1 (4.45) 
and 
Cn (4.46) 
curv | — a2 H? & s 


Note that at z < 1, one can neglect z’ in the integrand in (4.40), then y(z) = 
z/(aoHo) and r(z) = aox(z), so we get back to the Hubble law r(z) = Ho ‘z 
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Fig. 4.4. Dependence of Hor(z) on redshift z for various cosmological models. 


In that case, the leading order expression for brightness coincides with the usual 
formula 


= ee ZE 
Let us now discuss the general case. It is clear from (4.40)—(4.46) that the three 
independent parameters enter the brightness-redshift relation in a non-trivial way. 
In principle, all of them may be determined by measurements in a wide range of z. 
This is illustrated in Fig. 4.4. 

To understand what is shown in Fig. 4.4 we note that the dependence on cos- 
mological parameters enters the formula (4.43) through the function r(z). If we 
measure r(z) in Hubble units Hj‘, then 


1 
Hor(z) = Wan sinh y(z), (4.47) 
5 V Peurodz' 


x(z) = 73 2° 
0 Qy(1 + 2’) + Qa + Qeurv(1 +z’) 


Hence, the right-hand side of (4.47) does not explicitly depend on Hp; it is this 
quantity that is shown in Fig. 4.4. 

Let us first discuss black and dark-gray curves in Fig. 4.4, which correspond to 
spatially flat Universes with: Qu = 0.27, Q, = 0.73 (black curve) and Qy = 1, 
Qa = 0 (dark-gray curve). We obtain formulas for the flat model from the general 
expression (4.42) by taking the limit ap > c0, Qeury > 0. In this way we get 


E dz' 
r(z) = ; ouro = 0, (4.48) 
Ho 0 Q(z! + 1)2 +O 
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Fig. 4.5. Degeneracy in the parameter space (Qm, Qa). The cases (Qm = 0,94 = 0.55) and 
(Qm = 0.6,Q, = 0.85) correspond to open and closed models, respectively, with Qeury = 1 — 
Om — Qa. Note that the horizontal scale here is logarithmic, unlike in Fig. 4.4. 


with Qw + Qa = 1. It is clear that the larger is Q, (and, accordingly, the smaller is 
Qu), the faster the function r(z) grows with z; distant supernovae are dimmer in 
ACDM model as compared to the flat model without dark energy. It is this property 
that have been observed [28, 29]. Figure 4.6 shows one of the early sets of data [64], 
while Fig. 4.7 illustrates uniqueness of the interpretation in terms of dark energy 
with enlarged dataset [65]. 

Now, black and light gray lines in Fig. 4.4 correspond to ACDM model and 
open model without the cosmological constant. They differ already at moderate 
z. Therefore, the model with Q, = 0, Qeury = 0.7 is also inconsistent with data. 
Overall, data on SNe 1a reject models without dark energy, see Figs. 4.6—4.9. 

This is probably the strongest argument for dark energy. We stress, however, 
that there are other, independent arguments. Namely, we mentioned already the 
argument based on the extrapolation of the mass estimates of clusters of galaxies 
to the whole Universe (giving Qm ~ 0.3), together with CMB bound on spatial 
curvature. We also presented the argument based on the age of the Universe. Other 
arguments come from the analysis of CMB and large scale structure; some of them 
are discussed in the accompanying book. 

Let us now turn to Fig. 4.5. We present it here to illustrate the degeneracy 
in parameters: models with very different parameters give very similar results at 
moderate z; this range of z is of particular interest, since objects at large z are dim, 
and hence difficult to observe. To see what is going on, let us find the first correction 
to the Hubble law at small z. We make use of the relation Q, = 1 — Oy, — Qeury 
and write to quadratic order in z 


1 “30 + 2Qeurv) 
Gallo zZ 4 M curv 


x(z)= 
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Fig. 4.6. Hubble diagram for SNe Ia: early data [64]. The upper panel shows the brightness dis- 
tribution of supernovae (appropriately corrected). The lower panel illustrates the incompatibility 
of observations to the CDM model with spatial curvature (Qm = 0.2, Qa = 0, Qcurv = 0.8, dotted 
line) and flat CDM model (Qm = 1, Qa = 0, Qcurv = 0, dashed line). The black line is the pre- 
diction of the ACDM model with Qm = 0.28, Qa = 0.72, Qeury = 0.0 which is consistent with the 
data. The notation on vertical axis is related to brightness measure in astronomy, apparent magni- 
tude. The difference (m— M) is related to photometric distance by m— M = 5 logo (rph /Mpc)+25. 
The larger (m — M) the dimmer the object. 


To this order r(z) = aox(z) + O(2?), so 


2 
r(e) = Fo |2 L(y + Neuro): (4.49) 
The second term in the right-hand side is precisely the correction we are concerned 
about. It is clear from (4.49) that this correction depends only on the combination 
(30 .,+2Q curv) or, in terms of Qu and Q,, on the combination (Qu — 294), but not 
on Qu and Q, separately. It is this property that is responsible for the degeneracy at 
small z. To study the degeneracy at moderate z one has to use higher order terms in 
z in Eq. (4.49). It turns out that the terms depending on the combination ((Qy,—-2Q,) 
partially cancel at the orders z? 
Uncompensated contribution of order z? depends on another linear combination, 


and z® in the interesting range of parameters. 
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Fig. 4.7. Plots [65] (2004) illustrating different possible interpretations of observations of SNe Ia. 
They show the deviation from the prediction of a model of curved empty Universe (Qm = Qa = 
0, Qcurv = 1; recall that such a Universe expands at constant speed, @ = 0). Among cosmolog- 
ical models, considered are flat ACDM (Qm = 0.27,Q, = 0.73) and flat CDM model without 
dark energy (Qm = 1,Q, = 0). Model with evolution is the model in which absolute lumi- 
nosity of SNe la decreases like z~! (in flat CDM cosmology). Models with non-standard inter- 
galactic medium are models with “dust” absorbing the supernovae light; the “dust” densities are 
p(z) = po(1 + z)%, where a = 3 (dash-dotted line) and a = 3 at z < 0.5 while a = 0 at other 
z (thin black line). Data at upper panel are from the Hubble telescope (bullets) and terrestrial 
telescopes (diamonds). At the lower panel these data are combined and binned in z. We note that 
independent observations of SNe Ia [66] (2005) give results consistent with those shown here. More 
recent supernovae catalogs are also consistent with the data shown here, see Fig. 4.8. 


(20,,—-Q,). Hence, the observations studying the photometric distances at moderate 
z are sensitive to the latter combination. The data show that this combination is 
close to zero, while there is approximate degeneracy along the orthogonal linear 
combination. Therefore, the allowed region of parameters is stretched along the line 
20, — Qa = 0. This is seen in Fig 4.9 [67]. 


Problem 4.8. Show that to the order z? the degeneracy in parameters is removed, 
ie., r(z) given by (4.42) depends in a non-trivial way on all three parameters 
Ho, Qu, Qa. Show, nevertheless, that in the interesting region of cosmological 
parameters there remains approximate degeneracy at moderate z along the line 


29m — OQ, =0. 


There is nothing surprising in the degeneracy in parameters. It is clear that the 
lowest in z correction to the Hubble law can depend only on the present values of 
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Fig. 4.8. Hubble diagram for distant objects [75] (2011). Solid curve is the best fit for spatially 

flat ACDM model. See Fig. 13.13 for color version. 


the Hubble parameter and acceleration parameter. We define the latter as follows,® 


1 /a 
eed) 4. 
qo H? (5). ( 50) 


By measuring one parameter go, one can determine only one combination of Qw 
and Q4, hence the degeneracy. 

Regarding Fig. 4.9, let us make a general comment. We often show regions in a 
parameter plane allowed by one or another data. If we do not state the opposite, 
three regions embedded into each other correspond to regions allowed at lo, 20g and 
30 confidence level (assuming Gaussian distribution for relevant quantity), i.e., at 
the confidence level of 68.3%, 95.4% and 99.7%. 


Problem 4.9. Find the lowest in z correction to the Hubble law, i.e., the function 
r(z) at quadratic level in z, in terms of Ho and qo. Do not use the Friedmann 
equation. Show that after using the Friedmann equation, this expression coincides 


with (4.49). 


It is traditional to use deceleration parameter, which differs by sign from the acceleration param- 
eter (4.50). We think that using deceleration parameter for accelerating Universe does not make 
much sense. 


4.6. Brightness—Redshift Relation for Distant Standard Candles 87 


2.0 = 
Union08 SNe 
ee Union08 SNe w/ sys 

1.54 | 
5 

1.04 | 

ost i | 

ool a SN = 

0.0 0.5 1.0 


Qn 


Fig. 4.9. Regions in the plane of cosmological parameters (Qm, QA), consistent with observa- 
tional data on SNe Ia [67]. Dotted lines show contours obtained with proper estimates of possible 
systematics. 


The approximate degeneracy in parameters makes it difficult to determine Qw, 
Q, and Qeury by studying standard candles only. On the other hand, CMB tem- 
perature anisotropy gives strong bound on Qeurv: |Qeurv| < 0.01. Making use of 
this bound, one can determine Qw and Q, from the observations of SNe Ia. This is 
shown in Fig. 4.9, where the line Qeury = 0 is denoted as Qiot = 1; it is seen that 
supernovae observations give (0.23 < Qu < 0.39, 0.77 > Q, > 0.61) at 95%CL. 

To conclude, the observations of SNe Ia together with CMB and BAO measure- 
ments are one of the main sources of information on dark energy. The fit to existing 
cosmological data gives the following determination, 


Qm = 0.31540-918 | Qa = 0.68549-017 


at 68% CL. 

Similar observations with better precision and statistics and at larger z will most 
likely make it possible to find the dependence of dark energy density on time (or 
establish strong bounds on this dependence). The existing data are consistent with 
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Fig. 4.10. Regions in the plane of parameters (Qm, w) allowed by observations of CMB anisotropy 
(orange, dotted lines), SNe Ia (blue, solid lines) and baryon acoustic oscillations (green, dashed 
lines). Shaded regions correspond to the combined analysis of all these data [75]. Small, middle 
size and large regions are allowed at confidence levels of 68.3%, 95.4% and 99.7%, including both 
statistical and systematic uncertainties. 


time-independent dark energy (i.e., dark energy equation of state p = —p), and 
under the assumption of time-independence of the parameter w of the dark energy 
equation of state p = wp, these data give (see Fig. 4.10 and Fig. 13.6 on color pages) 


—1.2 < w < —0.8. (4.51) 
Refining this result is a very important task for future observations. 


Problem 4.10. Generalize formulas of this Section to the case of dark energy with 
equation of state p = wp, w = const. Taking Qeury = 0 and Qy = 0.27, draw plots 
of r(z) for w = —2, w = —1.5, w= —1, w = —0.75 and w = —0.5. Making use of 
Fig. 4.9 show that existing data are indeed capable of determining w at the level of 
accuracy given in (4.51). 


The existing observations are consistent with the assumption that dark energy is 


the cosmological constant. Figure 4.11 shows the present physical distance r (4.48) 
as function of redshift in ACDM model. 


4.7. Angular Sizes of Distant Objects 


An important characteristic of an extended object (e.g. galaxy) is its angular size. 
In this regard, a useful concept is angular diameter distance D, that relates the 
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rt, Gpe 


Fig. 4.11. Present physical distance (4.48) to object with redshift z in ACDM model with Q4 = 
0.7, h = 0.7. 


absolute size of an object d to angle A0 at which this object is seen today, 


d= Da(z)- A9, 


where z is the redshift of the object. To find the expression for Da(z) let us recall 
again that in conformal coordinates photons behave in the same way as in Minkowski 
space-time. Therefore, the coordinate size of an object is related to its coordinate 
distance from us y and angular size A@ by the relation 


deonf = sinh x - A9. 


The physical size of the object emitting photons at time t; is equal to 


ti ‘ 
d = a(t;)deonf = atts) -ag sinh y - A0. 
ao 
Making use of (4.42) we obtain 
1 
Da = , 
(2) = zro) 


where r(z) is given by formulas of Sec. 4.6. 

Angular diameter distance increases with z relatively slowly at moderate z, see 
Fig. 4.12. On the other hand, photometric distance (4.44) rapidly increases with 
z, so galaxies become more and more dim. At large z, the large distance from the 
Earth to the galaxies shows up not in the smallness of their angular sizes but in 
their low surface brightness (visible brightness of a region of unit angular size). 

One way to measure the angular diameter distance Da(z) at various z (more pre- 
cisely, a combination of Da(z) and the Hubble parameter H(z)) is to make use of the 
baryon acoustic oscillations, BAO. We consider them in detail in the accompanying 
book, and here we give qualitative treatment. To begin with, we point out that at 
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z 


Fig. 4.12. Angular diameter distance as a function of redshift in ACDM model (Qm = 0.27, 
QA = 0.73, ere = 0). 


very early hot epoch, the perturbations in dark matter and baryon-electron-photon 
component coincide in space and, modulo a numerical factor, in amplitude (adia- 
batic perturbations): 


dpcpM(X) = 3 dpBy(x) 
PCDM 4 pay ` 


Later on, but still before recombination, the baryon—photon perturbations propa- 
gate with the sound speed us = \/Op/dp & 1/V3 (see Chapter 5), while the sound 
speed in dark matter vanishes, since its pressure equals zero. Let us imagine that 
initially there is overdensity of dark matter and baryons in some place in space. By 
the time of recombination t,, the excess in dark matter remains at the same place, 
wile the excess of baryons (and photons) moves from this place (as a spherical sound 
wave, if the initial excess is spherically symmetric) to distance 


t 
r” dt 
= alt) f Us ——. (4.52) 
o a(t) 

To obtain this formula, we recall that the coordinate distance dx that a sound wave 
travels in time interval dt is given by 

a(t)dx = u,dt. 
The quantity r,(t,) is called acoustic horizon at recombination; it is reliably calcu- 
lable once Qepy and Ng are known. The corresponding present size is 

ao 


altr) 


= 147 Mpc. 


Ts 


4.7. Angular Sizes of Distant Objects 91 


After recombination, baryons no longer interact with photons, pressure in the 
baryon component and hence sound speed in it fall practically to zero, and the 
baryon perturbations get frozen out in space. As a result, the overdensities in 
baryons and in dark matter are separated by the distance rs. The corresponding 
physical distance increases in time due to the cosmological expansion, 1,(t) x a(t). 
Both dark matter and baryons participate in galaxy formation, so our discussion 
shows that there must be an additional correlation in the density of galaxies at 
distance r,(t). This feature in the galaxy correlation function has been discovered 
indeed [76].” 
Thus, there is a standard ruler in the Universe at every z, whose size is 


1+ 2; 


ralz) =1Ts(tr) ca 


The measurement of an angle at which it is seen enables one to determine 
the angular diameter distance D,(z). In practice one measures the combination 
[D2(z)H~*(z)]/8, which is the geometric mean of distance scales in directions nor- 
mal to the line of sight (Da) and along the line of sight (H~+). These measurements 
cover today a fairly large region z < 1; see Fig. 4.13. 

Presently, BAO is an important instrument for studying the dynamics of the 
cosmological evolution at late epoch, and in particular for analyzing the dark energy 
properties. 


400 - — WMAP ACDM - 
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Fig. 4.13. The measured values of Dy = [z(1 + z)2D2(z)/H(z)]!/%, according to the data on 
structures [77]. Theoretical prediction is made in the ACDM model, the gray band corresponds 
to the cosmological parameters inferred from CMB data; rs here denotes the present size of the 
sound horizon at recombination and r, ¢;q is its value obatined from CMB data. 


"In Fourier space, it yields oscillations in the power spectrum, hence the name. 
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4.8. *Quintessence 


Cosmological constant (vacuum energy density) is not the only possible reason of 
the accelerated expansion of the Universe at the present epoch. Nature of dark 
energy is one of the major problems of contemporary natural science. No wonder, 
numerous hypotheses have been put forward in this regard. One of these hypotheses 
is the existence of “quintessence”, spatially homogeneous field whose energy plays 
the role of dark energy [68-70]. Unlike the cosmological constant, quintessence is a 
dynamical field, and its energy density depends on time. In terms of the effective 
equation of state p = wp this means that w # —1 with, generally speaking, time- 
dependent w. 

Quintessence is often (but not always) associated with a scalar field. As a con- 
crete example, we consider one class of models of this sort later in this Section, 
but before that we study general properties of scalar field dynamics in expanding 
Universe. These results will be useful also in our discussions of other topics. Under- 
standing the dynamics of scalar field is particularly important in the context of 
inflationary theory. 


4.8.1. Evolution of scalar field in expanding Universe 


Let us consider a theory of real scalar field with action 
1 V 
S= J tava = ftev Fa ð pð — V (p), (4.53) 


where V(ẹ) is the scalar potential. Let us stick to spatially flat Universe whose 
metric has standard Friedmann-Lemaître-Robertson-Walker form (2.22) with scale 
factor a(t) being a known function of time. 

The equation for the scalar field is obtained, as usual, by varying the action (4.53) 
with respect to y, and has the form 


1 OV 
——28,(V/—g9"" 8,0) = -—. 4.54 
Ta (v —=99"" vp) De (4.54) 


Let us specify to spatially homogeneous (independent of coordinates) scalar field 
p(t) in the background metric (2.22). In that case, Eq. (4.54) reduces to 
OV 
p+ 3Hy = -—. (4.55) 
dp 
Equation (4.55) formally coincides with the equation for the classical mechanics of a 
“particle” with coordinate y in the “potential” V (y), which experiences friction with 
time-dependent friction coefficient H. Depending on the strengths of the driving 
force and friction, two regimes are possible: (1) fast roll regime when Hp « V’ 
(prime denotes the derivative with respect to y), friction is weak, and the “particle” 
rapidly rolls down to the minimum of the potential V (4); (2) slow roll regime when 
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friction is strong, and the “particle” barely moves. Let us begin with the second 
regime. In that case 
He~V'. (4.56) 


During the Hubble time H~! the field changes by 


This change is small compared to the field itself, dp < y, provided that 


1 


A «x H’. (4.57) 
p 


For power-law potentials like m?y? or Ayt one has V’ ~ V~! 


for slow roll is 


, so the condition 


y 
z< H?. (4.58) 


Thus, in the case of power-law potential, the condition (4.58) is necessary for slow 
roll, at which the value of the field remains practically constant during the cosmo- 
logical evolution. Once this condition is violated, the field rapidly rolls down to the 
minimum of V(y). 


Problem 4.11. The condition (4.58) is not sufficient for slow roll. The second 
necessary condition is that the first term on the left-hand side of Eq. (4.55) does not 
exceed the second term, 6 L Hy. The latter condition ensures that velocity does not 
change much in one Hubble time, 5g ~ HT! < ¢, so the velocity remains small 
during the evolution. Find the second slow roll condition in terms of the potential 
V(y) and its derivatives, and H(t) and its derivatives. Simplify this condition in the 
case of power-law dependence of a(t) on time (like a x t?/3, matter domination). 


Problem 4.12. Show that for the power-law dependence of the scale factor on time, 
a(t) = t*, a > 1/3, the general solution to (4.55) in the approximation V’ = const is 


g=y+C--#)+d- i O! (4.59) 


t 


where p; is the initial value of the field, pi = (ti). In the case p(ti) < H(ti) y(t), 
find the constants C and d in terms of pi and ġ(ti). Show that in this case the 
third term in (4.59) is always small. Find the values of t at which the second term 
in (4.59) is small compared to the initial value pi and show that for a power-law 
potential this time is in accordance to (4.58), and in general case to (4.57). Show 
that in this time interval the relation (4.56), as well as p ~ H¢, hold. 


Although we will not need in this Section to know the behavior of the scalar 
field near the minimum of the scalar potential, let us consider how y(t) approaches 
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this minimum. Let the minimum of V(y) be at p = 0, and near the minimum the 
potential be 


m 
Vo) = y. (4.60) 
Then Eq. (4.55) near the minimum has the form 
seg Os 2 
G+3-p+m p=0. (4.61) 


To analyze equations of this sort, it is convenient to get rid of the friction term 
proportional to ù and cast the equation into the form of the oscillator equation with 
time-dependent frequency (cf. end of Sec. 2.3). In our case the change of variables 
is given by 


y(t) = way - x(t), 


where y is a new unknown function which obeys 


z 34 3a? 
Note that 
2 
a a 
—_~ —=H", 
a ae 


It then follows from (4.62) that at m? < H? the mass term is negligible. In that 
case the potential (4.60) obeys the condition (4.58), and we come back to the slow 
roll regime. 


Problem 4.13. Obtain the results of the previous problem starting from Eq. (4.62). 


We are now interested in the opposite regime, m? >> H?. In this case, we can 
neglect terms of order H? in parenthesis in (4.62), so the solution has the form 
x = const -cos(mt+ 3) where ( is an arbitrary phase. Hence, at m? >> H? the field 
relaxes to the minimum in the following way, 


_cos(mt + 8) 


p(t) = px ae)” (4.63) 


where Y, is some constant. The field oscillates near the minimum with decreasing 


amplitude. Note that the relative change of the amplitude in one period of oscilla- 
tions is of order 


so it is small in the case under study, m > H. 
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The solution (4.63) can be found also from energy considerations. In the case of 
free massive scalar field, the action (4.53) in the Universe with metric (2.22) takes 


the form 
1 1 m? 
= 4,,.,3,{+.2 + 7a yo Mo 
g= daa (52 542 iv) ot). 


If the scale factor were independent of time, the homogeneous field would have 


conserved energy, 
1 m? 
E= | d&x-a3-( =o? + —y?). 
J x-a (3e FY 


The solution for the field equation would oscillate with frequency m, i.e., yp œX 
cos(mt + 3). If the scale factor slowly (adiabatically) grows in time, the oscillatory 
behavior of solutions persists, and energy is still (approximately) conserved, as we 
know from classical mechanics. The latter property means that 


1 2 
a? (t) (58 + =) = const. (4.64) 


~3/2  ie., we come back 


This tells that the amplitude of oscillations decreases as a 
to the solution (4.63). Note that the approximate conservation law (4.64) can be 
interpreted as (approximate) energy conservation in comoving volume. Note also 
that for fast changing a(t), there is no conservation of energy at all; energy is 
not conserved in time-dependent backgrounds (in this case a(t)). It does not make 
sense to talk about energy (including energy of the gravitational field) of the entire 
Universe; there is no such an integral of motion in General Relativity.® 

To end this Section, let us find the energy-momentum tensor of the scalar field 
in different regimes. The general expression for energy-momentum tensor in the 
theory with action (4.53) is 

2 6S 


w= TS gav On POP — Iur L. (4.65) 


For homogeneous scalar field and potential V (p) we have for non-zero components 
in the locally-Lorentz frame (i.e., setting the local metric guv in (4.65) equal to nuv) 


1, 1. 
Tay =5P Vl) = 0 Ty = (3P -VO ) Oy = poby (466) 
The estimate (4.56) gives in the slow roll regime 
. ye? 
$? ~ H2 < Vv’ P, (4.67) 


8 There are certain cases in which one can define total energy (including energy of the gravitational 
field) in the framework of General Relativity. One special case is asymptotically flat space-time. 
So, the notion of energy (mass) of a gravitating body, away from which space-time is Minkowskian, 
is well-defined. 
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where we used (4.57) to obtain the inequality. Power-law potentials obey V’yp ~ V, 
so (4.67) gives 


p? «V. 
Hence, in the slow roll regime 


Po =~ —pp ~ V (9), (4.68) 


i.e., the equation of state is approximately the same as for vacuum, p ~ —p (although 
it is clear from (4.66) that one always has p > —p). 
To discuss the regime of fast oscillations about the minimum of the scalar poten- 
tial, we use (4.60) and (4.63) and obtain 
2,42 2,2 
M Py 1 _ mpl 
Too = o PO Tij = ~~ Ga Cos(2mt + 2) - dij, 
where we again used inequality H < m. Thus, energy density and pressure, averaged 
over a period of oscillations, are 
29 
2 m'p, 1 = 
Too = Pe = ~~?) ` a3(t)’ Ti; = Po Oi; =0. (4.69) 
We conclude that average energy-momentum tensor of the coherent scalar field 
oscillations coincides with the energy-momentum tensor of non-relativistic matter: 
pressure is zero, while energy density decreases as a73. As we already mentioned, 
the latter property reflects the energy conservation in comoving volume; see (4.64). 
From the quantum field theory viewpoint, homogeneous oscillating scalar 
field (4.63) is to be viewed as a collection of free particles of mass m, all at zero 
spatial momentum (at rest). The number density of these particles is given by 


p_ 1. 4 1 


Zm 2? BH 


At any number density, n(t) decreases as a~. The fact that pressure vanishes has 
natural interpretation in this picture: particles at rest do not produce any pressure. 


4.8.2. Accelerated cosmological expansion due to scalar field 


Accelerated expansion of the Universe at the present epoch may be explained by 
introducing scalar field y (quintessence) with action (4.53) and choosing the poten- 
tial V(w) and the present value of the field y in such a way that the evolution of 
the field occurs in the slow roll regime. Furthermore, one should assume that the 
field y is spatially homogeneous; initial data of the latter type are natural in the 
framework of inflationary theory, so the latter assumption is not particularly strong. 

The effective equation of state in the slow roll regime is pp + —py, see (4.68). 
Thus, the Universe indeed undergoes accelerated expansion, if the scalar field gives 
dominant contribution into the energy density. Let us find the conditions under 
which the slow roll regime indeed holds. Let us assume for definiteness that near 
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the present value of y the scalar potential may be approximated by power law, 
V(y) x y! where |k| is not very large. Then the slow roll condition takes the 
form (4.58). Since energy density in the Universe is mostly due to the scalar field, 
the Friedmann equation is 


2 87 87 


where we used (4.68). By combining (4.58) and (4.70) we obtain 
p > Mpi. (4.71) 


This is the form of the slow roll condition for power-law potentials and the Universe 
dominated by the scalar field itself. 

Despite the fact that the value of the scalar field is extremely large, the value of 
the potential V (p) must be very small, 


V(¢~) ~ Pe 


(more precisely, V(y) = 0.7p-). This means for power-law potential (and also for 
more general potentials) that it must be extremely flat. As an example, in the case 
V(y)= me 9? the mass must be extremely small, 


< Vee Pont —33 
Š Mp 0 eV, 


while in the case V (p) = Ayt the coupling must be tiny, 
X< Iom, 
Thus, the quintessence idea works for very exotic scalar potentials only. 


Problem 4.14. Show that for power-law potentials the condition (4.71) ensures 
p< Ho. Thus, the second slow roll condition (see Problem 4.11) is satisfied auto- 
matically. 


Quintessence models can, at least in principle, be tested by observations. In these 
models, the relation p = —p for dark energy is approximate. Dark energy density 
depends on time, so the cosmological expansion is different from that in the model 
with cosmological constant. 


Problem 4.15. In quintessence model with potential V (p) = me p, find the present 
value of the dark energy equation of state parameter w as function of the present 
value (to) = po. Choose the present value po in such a way that wo = 0.95, 
and find w(z) as function of redshift at 2>z> 0. Take the values Q, = N, = 0.7, 
Qu = 0.3 and make use of the fact that the scalar field changes slowly at the present 
epoch. 
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Problem 4.16. Under conditions of the previous problem, and with wo = 0.95 find 
numerically the photometric distance rpn(z) as function of redshift (see Sec. 4.6). 
Draw a plot analogous to Fig. 4.4 and compare it to the plot with Qu = 0.3, Qa = 0.7 
and time-independent pa. 


Finally, let us note that the future of the Universe in quintessence models is, 
generally speaking, different from the future of the Universe with cosmological con- 
stant, see also discussion in Sec. 1.4. This difference is particularly strong in models 
where the field y(t) rolls down to negative values of the potential V(y): in that case 
expansion will eventually terminate, and the Universe will recollapse to singularity. 


Problem 4.17. Assuming that today p = po > Mp1, study the future of the Uni- 
2 

verse in a model with potential V (p) = 4-y? + ey, where |e| « m?Mp; is a small 

parameter, and m is such that V (po) ~ pe. Hint: Use the results of Sec. 4.8.1. 


We stress that quintessence models do not, generally speaking, solve the cos- 
mological constant problem. They only divide this problem into two parts: one is 
the question of why the “true” cosmological constant (vacuum energy) is zero, and 
another is the question of why the energy density of quintessence is so small. The 
first question cannot in principle be answered in quintessence models; the second 
has to do with naturalness of the scalar potential. Furthermore, there is a new ques- 
tion in quintessence models: what mechanism ensures the “correct” present value 
of the scalar field? A possible answer to the latter question is given in models with 
suitable scalar potentials [69-73]. A subclass of these are “tracker” models [70, 73] 
which we now discuss. 


4.8.3. Tracker field 


Let us consider a subclass of quintessence models, where the potential has the form? 


M4 


? 
ny” 


V(¢) 


where M is a parameter of dimension of mass, and n > 2 is a numerical parameter. 
At radiation or matter domination, when a(t) œ t, the field equation (4.55) takes 
the form 


., 30, Mr 
pr FY = prt = 0. (4.72) 
It has special, “tracker” solution 
p(t) = CMY, (4.73) 


9This potential is rather exotic from particle physics viewpoint. 
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where 
2 
v= —, 
n+2 
and C = C(n,a) is determined from Eq. (4.72). 


(4.74) 


Problem 4.18. Show that (4.73) is indeed a solution to Eq. (4.72). Find C(n, a) 
entering (4.73). 


Solution (4.73) is a power-law attractor: if at initial time t; the field y; is smaller 
than solution (4.73) taken at the same time, y; < vy”) (ti), then the driving force 


M4 
F(y) =-V'(y) = yT 


is larger than the driving force for the solution (4.73), 


F(p(t)) > F(p (6). 


Hence, the solution with y; < y“")(t;) catches up the tracker solution (4.73). Con- 
versely, a solution with y; > vy") (t;) rolls slower than the tracker solution, so the 
former also approaches the latter in the course of evolution. This precisely means 
that the solution (4.73) is an attractor: in sufficiently wide range of initial data, 
solutions tend to (4.73) at late times; the evolution of the field ọ is basically inde- 
pendent of initial data and is described by the tracker solution. 
The solution (4.73) obeys (we omit the superscript (tr) in what follows) 

£2 1 

p“ ~ V (p) x T: 
It is clear, first, that the regime of evolution of the field is not the slow roll regime. 
Second, energy density p, (see (4.66)) decreases in time slower than the energy 
density of the dominant matter (relativistic or non-relativistic): the latter decreases 
as H? x t~?. The relative contribution of the tracker field into total energy density 
increases in time as t?”. Third, since a x t*, we have 


Pp X ar: 
Qa 


Making use of (3.41) we find the following expression for the equation of state 
parameter wy entering py = Wypy, 


ee 21-v 

Wy =-1+5 : 

e 3 Qa 

This gives, with account of (4.74) and the results of Sec. 3.2.4, 
n 2 


= w— — 4; 
Wy ua E (4.75) 


where w is the parameter of equation of state of the dominant matter (w = ł and 
w = 0 for relativistic and non-relativistic matter, respectively). Thus, equation of 
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state for tracker field depends on the equation of state of dominant matter. This is 
the origin of the term “tracker field”. Note that at large n the equation of state of 
the tracker field is close to that of dominant matter, wy ~ w. 


Problem 4.19. By calculating pressure and energy density according to (4.66), 
derive (4.75) in an alternative way. 


Problem 4.20. Show that solution (4.73) obeys V” x H? at both radiation and 
matter domination. This is another reason to use the term “tracker field” . 


Solution (4.73) is valid at times when energy density of the tracker field is smaller 
than matter energy density. The relative contribution of tracker energy into total 
energy density grows in time, and tracker field starts to dominate at some point. 
After that, the solution (4.73) is no longer valid. Let us find the value of the field 
at the time when it just starts to dominate. At that time 


Vi) n o Ss, 
i.e., 
p? 
Pys p 
Matter energy density at that time is 
pu = SMH G) ~ Y 


Requiring py ~ pm we find that the tracker field starts to dominate when 
p ~ Mp. 


Let us note in parenthesis that the latter condition also implies that the initial value 
of the tracker field may be fairly arbitrary, but should obey y; < Mp). 

At later times the field y(t) increases, and fairly soon the relation (4.71) starts 
to hold. The evolution of y enters the slow roll regime, and the expansion of the 
Universe enters the regime of acceleration. 

Clearly, the cosmological evolution has rather special character at the transition 
period from matter domination to tracker field domination. Therefore, cosmological 
observations may well be capable to confirm or falsify the model. Existing data 
exclude neither cosmological constant nor tracker field; they are consistent with 
many other quintessence models too. 


Problem 4.21. Estimate, for various n, at what values of M the tracker field model 
is consistent with known facts about the accelerated cosmological expansion. 


Chapter 5 


Thermodynamics in 
Expanding Universe 


5.1. Distribution Functions for Bosons and Fermions 


We consider in most Chapters of this book processes that occur in expanding 
Universe filled with various species of interacting particles. As we will see, the rates 
of interactions between these particles are often much higher than the expansion 
rate of the Universe, so the cosmic medium is in thermal equilibrium at any moment 
of time. So, we will need a number of basic relations and formulas from equilibrium 
thermodynamics. We collect them in this Chapter. We note here that as a rule, 
the most interesting periods in the cosmological evolution are those when one or 
another reaction goes out of equilibrium (“freezes out”). The laws of equilibrium 
thermodynamics are still useful in such a situation for semi-quantitative analysis, 
as they enable us to estimate the time of departure from equilibrium and determine 
the direction of inequilibrium processes. 

The thermodynamical description of a system with various particle species is 
made in terms of chemical potential u for each type of particles. If there is a reaction 


Ay + Ag +-+ + An > Bi + Bo+-+++ Bw, (5.1) 


where A;, B; denote types of particles, then the chemical potentials in thermal 
equilibrium! obey 


Hay + Hag to + May = May t+ Mag +7 + Mey (5.2) 


In particular, any reaction involving charged particles may occur with emission of 
a photon (e.g., there is an inelastic scattering reaction e~e~ — e~e~y). Therefore, 


1More precisely, equality (5.2) holds when the system is in chemical equilibrium with respect to 
the reaction (5.1), i.e., when the rate of this reaction is higher than the rate at which external 
parameters evolve (higher than the expansion rate in our case). 
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it follows from Eq. (5.2) that photon has zero chemical potential, 
Hy = 0. 
As another application of (5.2), let us consider the annihilation process 
et te © 24. 
Since uy = 0, Eq. (5.2) gives 
He- + et = 0. 


Clearly, the same relation is valid for any other particles and their antiparticles, since 
they can also annihilate into photons. Thus, chemical potential of an antiparticle 
equals to that of a particle taken with opposite sign. 


A convenient way to deal with all relevant reactions in thermalized system with various 
particle species is to introduce chemical potentials u; to conserved quantum numbers Qe 
only. Q™ should be independent of each other and should form a complete set of conserved 
quantum numbers. The chemical potential for particle of type A is then 


pa =X mY, (5.3) 


where Q? are quantum numbers carried by particle A. Say, at temperatures 200 MeV 
< T < 100 GeV, conserved quantum numbers are baryon number B, lepton numbers Le, 
La, L- and electric charge Q (see Appendix B, color of quarks and gluons is irrelevant 
for us here), i.e., the complete set is Q® = B, Le, Ly, L,,Q. In this case the chemical 
potential of, say, u-quark (baryon number 1/3, electric charge 2/3) is 


tye 
Hu = 3HB + ZQ, 


while for d-quark, electron and electron neutrino we have 


1 1 
Ha = 3HB = HQ, pHe™= Hre HQ, Hve = Hre- 


Relations of the type (5.2) hold automatically for all reactions; an example is a weak 
process 


ute — d+ rie. 


Problem 5.1. Show that the relations (5.2) hold automatically if chemical potentials are 
given by (5.3), and charges Q® are conserved in all reactions (5.1), i.e., 


OP +R =Q9 + Q9. 


Now, the chemical potentials to the charges Q™ can be found if charge densities nou) 
are known. Indeed, the number density of each particle species na is a function of a, so 
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the system of equations 
3 QP na = ngo 


together with (5.3) determines? all p; in terms of ngoi). 


Particle interactions in cosmic plasma are often fairly weak; we will quantify 
this statement in appropriate places. In this case the equilibrium distributions in 
spatial momenta p in locally Lorentz frame are given by distribution functions of 
Bose- and Fermi-gases, 


f(p) = (On)3 Bw) /F FI" (5.4) 


Here 


E(p) = Vp? +m? (5.5) 


is energy of a particle of mass m, T is the temperature. The minus sign in (5.4) 
refers to bosons, plus sign to fermions. 

Sometimes it is possible to disregard +1 in the denominator in (5.4), then the 
distribution has the universal Maxwell—Boltzmann form, 


1 
(27)° 


f(p)= e7 (E@)-H)/T (5.6) 


This form describes, in particular, low density gas of non-relativistic particles, for 
which m > T, (m — u) > T. In the latter case 


f(p) = Gee ‘ig Be (5.7) 


Upon integrating the distribution function over momenta one obtains the num- 
ber density of particle species i, 


= gi | f(p)a°p = Ang: | FE) P? — ne Bd. (5.8) 


When obtaining the second relation, we integrated over angles and used the fact 
that 


EdE = |p|d|p|, (5.9) 


following from (5.5). The factor g; in (5.8) is the number of spin states (degrees of 
freedom). As an example, photons, electrons and positrons have gy = ge- = ge+ = 2, 
for neutrinos and antineutrinos gy = gp = 1, while the overall number of degrees 
of freedom of Wt-, W~-, Z°-bosons and the Higgs boson in the Standard Model 


2It is important here that quantum numbers QO are independent and form a complete set. 
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at T > 100 GeV is equal to 10: massive W+-, W~-, Z°-bosons have 3 polarizations 
each, while the Higgs boson adds one degree of freedom.* 

It follows from the form of the distribution function (5.4) that the difference of 
number densities of particles and their antiparticles depends on chemical potential. 
These differences are typically very small at high enough temperatures as compared 
to number densities themselves. As an example, at temperatures T > 1 GeV the 
asymmetry between quarks and antiquarks is of order 10710; see Sec. 1.5.5. The rel- 
ative difference of electron and positron numbers is of the same order: the Universe 
is electrically neutral, so the net positive charge of quarks is compensated by the 
negative charge of electrons. Hence, chemical potentials are indeed very small in the 
early Universe and can often be neglected. 

Energy density p; for particle species 7 is given by the following integral 


oe f f(p)E(p) dp = 47g: fre f(E) E? — m2 Bab, (5.10) 


where we again integrated over angles and used (5.9). 

To find the expression for pressure, let us consider surface element AS orthogonal 
to the third axis. The number of particles of momenta between p and p+ dp hitting 
this surface during time At is 


An = vz f (p)@pASAt, 


where 


P 
y= T >0 
is the third component of velocity. Upon elastic scattering off the surface, particle 


whose third component of momentum is p, transfers to the surface momentum 
Ap = 2pz. 


Pressure is equal to the ratio of the total transfered momentum to the time At and 
area AS, so we have 


2 
Pi = s f 222 F(p)d°p 
i (5.11) 


_ 4ngi [7 Ipl*dipl p n Argi [> 2 233/2 
-EE BE fp E| AEE -mide 


3We show in Chapter 10 that electroweak symmetry is restored at high temperatures, and W- and 
Z-bosons are massless. So, another way of counting the degrees of freedom is more adequate: two 
degrees of freedom due to each of the W+t-, W7 -, Z°-bosons and four due to the complex Higgs 
doublet. 
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where we made use of the fact that only half of particles move towards the surface, 
and that on average 


due to spatial isotropy. 


Problem 5.2. Derive the expression (5.11) by calculating the energy-momentum 
tensor for gas of non-interacting relativistic particles whose distribution function is 


f(E). 


Let us now study the expressions for number density, energy density and pressure 
in physically interesting limiting cases. We begin with gas of relativistic particles, 


T> Ti; 
at zero chemical potential, 
Hi = 0. 


Expression (5.10) gives then the Stefan-Boltzmann law, 


2 
gi —T* — Bose 


gi E’ 30 
S| yr . 5.12 
P Or? eL/T a l 7 r2 m : , ( ) 
3h 30. T ermi 


The calculation of the integrals used in this Chapter is given, e.g., in the book 
[78]; we collect the relevant formulas in the end of this Section. As one should 
have expected on dimensional grounds, the energy density is equal to Tt up to a 
numerical coefficient; the contribution of fermions differs from that of bosons by a 
factor 7/8. Note that relativistic particles are conveniently characterized by their 
helicity — spin projection onto the momentum direction; the parameter g; is then 
the number of helicity states. 

If there are various relativistic particles of one and the same temperature T, and 
chemical potentials are negligible, the energy density of the relativistic component is 

2 


T 
P= 9T", (5.13) 


where 


a= X +e Y Gi (5.14) 


bosons fermions 
with m &« T with m &« T 


is the effective number of degrees of freedom. 
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The expression (5.11) for pressure has a simple form in the relativistic case, 


i E’ i 
I dE = Ë. (5.15) 


Pi = Gq2 J cEIT +1 3 


Thus, equation of state of relativistic matter is 


1 
P= zp 


This implies, in particular, that the sound speed squared is 


2 Op 1 
u = => =. 
Op 3 
Problem 5.3. Show that the relation p = p/3 is valid for gas of relativistic particles 
which is not necessarily in thermal equilibrium. 


Finally, the expression (5.8) for number density gives in the relativistic case 


gr — Bose (5.16a) 

gi E? è g2 
raer — Fermi (5.16b) 

T 


Numerical value for zeta-function here is 


Making use of (5.12), (5.16) we find the average energy per particle, 


? 


pi 2.70 T — Bose 
(5.17) 


(BE) =—® E 
3.15 T — Fermi 


As a simple example of application of these formulas, let us estimate the max- 
imum temperature at which relativistic cosmic matter is in thermal equilibrium 
with respect to electromagnetic interactions. We consider here the high temperature 
case, T >> 1 MeV, when electrons are relativistic. Relevant processes are Compton 
scattering, ee~ -annihilation (Fig. 5.1), etc. Amplitudes of these processes are pro- 
portional to a = e?/(4r), and the cross-sections to a?. Together with dimensional 


Y Y 


H 


Fig. 5.1. Feynman diagrams for Compton scattering and e+e~-annihilation. 
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arguments, this gives for the rate (inverse of time between scatterings of a single 
particle with other particles in the medium) 


T~ T. 
The processes are in thermal equilibrium if this rate exceeds the cosmological expan- 
sion rate, 
r > H(T). 

We know from Sec. 3.2.2 that 

T? 

H(T) = —. 
Mp, 


Hence, the cosmic plasma is in thermal equilibrium at 
T K @? Mý, ~ 1014 GeV. 


Very similar but not exactly the same estimate is valid for weak and strong (QCD) 
interactions. Thus, the cosmic medium is indeed in thermal equilibrium during long 
epoch of the cosmological evolution. As we mentioned above, most interesting are 
in fact inequilibrium phenomena; we discuss many of them in this book. 

Let us now consider the dilute gas of non-relativistic particles, whose distribution 
function is given by (5.6). In this case the expression for particle number density is 


Ni = Gi (mE (5.18) 
while energy and pressure are 
pi = Mini + Sil (5.19) 
and 
pi = TN; & pi. (5.20) 


As anticipated, non-relativistic gas has equation of state p = 0 up to corrections of 
order O(T/m). 


Problem 5.4. Show that at temperature exceeding the mass and chemical potential, 
T > m, pu, the difference of the number densities of particles and their antiparticles 
of a given helicity is 


T2 
An = Hy- Bose (5.21) 


and 


T? 
An = HT Fermi. (5.22) 
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Hints: (1) Recall that chemical potentials of particles and their antiparticles differ 
by sign only; (2) Take linear in u term in the integral (5.8) with m = 0, integrate 
by parts and use formulas presented in the end of this Section. 


Problem 5.5. Find the differences of the number densities of all relativistic Stan- 
dard Model particles and their antiparticles in the cosmic medium at temperature 
T = 400 MeV for given densities of baryon and lepton numbers ng, nL., NL, NL, in 
the realistic case ng,np.,NL,,NL, K TÌ. Hints: (1) At this temperature, relativis- 
tic are u-, d-, s-quarks, gluons, photons, electrons, muons and all neutrino species; 
other Standard Model particles are non-relativistic; (2) neutrinos have one helicity, 
while other fermions have two; (3) quarks have three color states, gluons have eight. 


Let us give numerical values of useful integrals. 


1. 


2. For positive integer n 


e7 — 1 4 
oo „2n—1 2n—1 _ 
f = g= 2 1 m Ba, 
o e4+1 2n 
where Bn are Bernoulli numbers, 
il il 1 
Bı =-=, B2 = —, B=— 
al 6 ? 2 30 > 3 42 ’ 


3. For arbitrary x 


fb Rae = 6-2 rece), 


where ¢(x) is the Riemann zeta-function, some of whose values are 
¢(3) = 1.202, ¢(5) = 1.037, ¢(3/2) = 2.612, ¢(5/2) = 1.341. 
Recall that for positive integer n 
T(n) = (n-— 1)! 


The values of gamma-function I(x) at half-integer x can be found by making use of 
the value 


(1/2) = VF 


and the relation [(1 + x) = zr (x). 


5.2. Entropy in Expanding Universe. Baryon-to-Photon Ratio 109 


5.2. Entropy in Expanding Universe. Baryon-to-Photon Ratio 


One of the main thermodynamical characteristics of a system is its entropy. Hence, 
it is useful to discuss entropy of cosmic medium in the expanding Universe. Recall 
that entropy in thermodynamics enters the first law. In the general case of variable 
number of particles the latter has the form 


dE = TdS — pdV + Sidi, (5.23) 


where S is entropy of the system and subscript 7 labels particle species. This 
subscript and the corresponding summation will be omitted wherever possible. 

Energy E and number of particles are extensive quantities (proportional to 
volume of the system), while temperature and pressure are local characteristics 
independent of the volume. In accordance to the first law (5.23) entropy is extensive 
quantity. It is convenient to introduce densities 


E N S 
== =— = >. 5.24 
eap MET, $y (5.24) 
Then the first law (5.23) is written as follows: 
(Ts —p-— p+ un)dV + (Tds — dp + udn) V = 0. (5.25) 


This relation is valid for both the entire system and any of its part. Using it for a 
region of constant volume inside the system we obtain 


Tds = dp — udn. 
We now use (5.25) for the entire system and find for entropy density 


s- Pte aH 
a 


An important example is relativistic matter at vanishing chemical potentials, for 
which 


pte 
s T (5.26) 


Making use of the latter relation and expressions of the previous Section for p and 
p we obtain the following expression for the contribution of ith particle species, 


aa ee . (5.27) 


By comparing these expressions to formulas (5.16) for number density, we see that 
entropy and number of particles differ in relativistic case by a numerical factor of 
order 1 only. 
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To obtain the expression for entropy in non-relativistic case, we make use of 
(5.19) and (5.20) and write 


5 Mi — hi 
si = zi t — p 


The chemical potential is related to the number density by Eq. (5.18), so we find 


finally 
3/2 
Ji miT / 
ni \ 2r ` 
Number density of non-relativistic particles is always small in cosmic medium; 
at temperatures T < 100MeV, when protons and neutrons are non-relativistic, 


Ny. 


their number density is estimated as ng ~ 10~%ny; see (1.21). The same estimate 
applies to electrons which are non-relativistic at T < 0.5 MeV. Thus, non-relativistic 
contribution to entropy is negligible, so the total entropy is given by 


s = ge — T’, (5.28) 


where the effective number of relativistic degrees of freedom is defined by (5.14). 
One of the key properties of entropy making it a very useful quantity is the 
second law of thermodynamics. According to this law, entropy of any closed system 
can only increase, and it stays constant for equilibrium evolution, i.e., slow evolution 
during which the system always remains in thermal equilibrium. Let us see how this 
law works in expanding Universe, assuming that the evolution of cosmic medium 
is close to equilibrium. To this end, we come back to the relation (5.23) and recall 
that it is adequate to introduce chemical potentials to conserved quantum numbers, 
rather than to individual particle species; thus, dN; in (5.23) should be understood 
as differentials of conserved quantum numbers. With this qualification, we apply the 
relation (5.23) to the comoving volume, V = a. Then the conservation of quantum 
numbers in comoving volume gives dN; = 0, so that 
ds d(a3s) 


T— =T 
dt dt 


= (p + p)dV + Vdp =a? [o+ o3 +l = 0; (5.29) 


where we used the covariant conservation of energy in expanding Universe, 
Eq. (3.13). We see that total entropy in comoving volume is conserved, 


sa? = const. (5.30) 

As we have already mentioned in Sec. 3.1, the covariant conservation of energy has 
simple interpretation in terms of the entropy conservation in comoving volume. 

Entropy conservation is used for introducing quantitative, time-independent 

characteristics of asymmetries of conserved quantum numbers. In particular, as 

we discussed in Chapter 1, there is baryon asymmetry in the Universe, as there are 

protons and neutrons but none of their antiparticles today. At temperatures below 
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T ~ 100 GeV, there are no processes violating baryon number, so it is conserved in 
comoving volume,* 


3 = const, (5.31) 


(ns — nps)a 
where npg and ng are number densities of baryons and antibaryons. By comparing 
(5.30) and (5.31) we see that ratio 


ipa e (5.32) 


is a time-independent characteristic of the baryon asymmetry. 

Let us give an estimate for the baryon asymmetry in the early Universe. When 
discussing the Universe at relatively low temperatures (T < 100keV), one tradi- 
tionally uses baryon-to-photon ratio 


ng 
Np = m 


ny’ 
where n, is the photon number density. At these temperatures 7, is independent of 
time, and it differs from A, by a numerical factor of order 1. The latter factor is due 
to both the difference between n, and photon entropy sy (see (5.16) and (5.28)), 
and the neutrino contribution to entropy (see Chapter 7). At T < 100keV (but, 
strictly speaking, at temperatures exceeding the neutrino masses) entropy density 
is 


aS gar, 5.33 
S 45 9*0 ( ) 
where 
7 4 43 
E ER EEREN On pee ena 
A TT 


is the effective number of relativistic degrees of freedom after neutrino decoupling." 
The first and second terms here are due to photons and neutrinos, respectively 
(effective temperature of neutrinos is related to photon temperature T, = T by 
(4.14); the origin of the factor z -2-3 is the same as in (4.15)). 

Photons and neutrinos are non-interacting in the present Universe, and neutrinos 
have masses. Therefore, the formula (5.33) is not valid at late times. However, the 
combination aT remains constant starting from temperatures of order 100 keV 
(below electron mass) and up until now. Therefore, the relation (5.30) is valid for 
late Universe, if one defines the quantity s by Eq. (5.33) for all temperatures below 


4In some scenarios, baryon asymmetry is generated at temperatures below 100 GeV, see, e.g., 
Sec. 11.6. We do not consider this possibility here. 
5Note the difference between the parameters gx,9 and g«, the latter entering (4.22). 
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100 keV. With this definition, the present value of entropy density is 


2 2 
so = SE oT = 2.9 -103 cm3. (5.34) 
Thus, we have 
ñ 2¢(3) 
Az = — ns = Ta Ne = 0.14 Hp. (5.35) 
5 15 9 0 


The value of baryon-to-photon ratio is obtained from studies of BBN and CMB. 
These two independent methods agree with each other and give [1] 


ns = (6.05 + 0.07) - 107. (5.36) 


In terms of the baryon asymmetry 
A, = 0.86- 107". 


As we discussed in the previous Section, this means, in particular, that baryon 
chemical potential is very small at T > 200 MeV. 


Problem 5.6. Estimate the value of the chemical potential for u-quark at temper- 
ature 1 GeV. 


To conclude this Section we make two comments. One is that the baryon-to-photon 
ratio is directly related to the relative energy density of baryons Qs = ps/pe = 
MpNg/pe, see Chapter 4. The present number of photons calculated according to 
(5.16a) is 


(3) 


ny = 2> To = 411cm”. 


Making use of the value (1.14) of critical density, we find from (1.21) 


Oph? = 0.0221 + 0.0003 , (5.37) 
and hence Q, ~ 0.05, the value quoted in Sec. 4.1. 
Problem 5.7. Estimate the contribution of electrons and protons into total energy 
at T = 50keV and T = 1eV. Hint: Recall that electrons and protons are non- 


relativistic at these temperatures. Neglect the presence of light nuclei in the plasma. 


The second comment concerns the baryon-electron—photon plasma before 
recombination. As we discuss in Section 6.3, baryons in it are in thermal equilibrium 
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with photons. The sound speed squared in this plasma is 


dp dp 4p 1 
w= — 7 = ——1_ = 5.38 
dp dp,+dpg 4p,+3pp 3(1+ Rp) (5.38) 


where we use py = p4/3 « Tt, pg x T’ and introduce the time-dependent para- 
meter 


At the recombination epoch® one has T, = 2970 K and 


3 _ MpNBNY 


Rs = 5 -0.75 ~ 0.46. (5.39) 


Py 


It follows from (5.38) and (5.39) that the sound speed before recombination is 
somewhat different from the relativistic value us = 1/ v3. This fact is, of course, 
accounted for in the data analyis, see Section 1.7.1 in this regard. 


Problem 5.8. Estimate the contributions of electrons and protons to entropy den- 
sity in the Universe at 100keV > T > 1eV. Hint: at these temperatures electrons 
and protons are non-relativistic and do not combine into atoms (recombination has 
not happened yet); the number densities of electrons and protons are equal; neglect 
the helium and other nuclei and atomes. 


5.3. *Models with Intermediate Matter Dominated Stage: Entropy 
Generation 


Let us consider a possibility that at some early epoch, before relatively late radia- 
tion domination, the expansion of the Universe was matter dominated. There are at 
least two scenarios for how that can happen. One of them assumes that there exist 
new heavy particles G in Nature whose lifetime 7 is large enough, and that in the 
beginning of the Hot Big Bang epoch the expansion is radiation dominated, but the 
cosmic medium contains an admixture of G-particles. G-particles are non-relativistic 
at temperature below their mass. As the Universe expands, energy density of the 
relativistic component decreases as a~*, while energy density of non-relativistic 
G-particles decreases as a~°. Therefore, at a certain moment of time t, the expan- 
sion becomes dominated by non-relativistic G-particles, and the intermediate matter 


6We take into account the fact that recombination of helium occurs at much higher temperature, 
so helium atoms decouple from photons and do not contribute to Rg. The ratio of the number 
of baryons in helium to the total number of baryons (helium mass fraction) is about 25%, see 
Chapter 8, hence the factor 0.75 in (5.39). 
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dominated stage begins. The main assumption here is that 
ta ae (5.40) 


where T is the total decay width of G-particle. During later evolution of the Universe, 
an important process is G-particle decay into light particles, say, quarks, leptons 
and other Standard Model particles. These rapidly (faster than in one Hubble time) 
thermalize and become part of the relativistic component. We will see that the 
epoch of G-particle domination ends up soon after the expansion rate slows down 
to H ~T: at about that time, all G-particles decay away, the Universe becomes 
filled with hot relativistic matter and the radiation domination resumes. We note 
that thermal equilibrium is strongly violated in the time interval tą < t <I~!: the 
concentration of G-particles is very far from equilibrium. This property is of course 
a consequence of (5.40); it gives rise to strong entropy generation in the Universe. 

This scenario is realized in some models with heavy gravitino, see Sec. 9.6.3, 
hence our notation for the new heavy particles. Another example is unstable sterile 
neutrino. 

The second scenario makes use of the observation that the role of non-relativistic 
matter in cosmology may be played by massive spatially homogeneous scalar field 
oscillating near the minimum of its scalar potential, see Sec. 4.8.1. As we discussed 
in Sec. 4.8.1, this field can be viewed as a collection of large number of particles 
at rest. It is assumed that before the Hot Big Bang epoch, all energy is due to 
this field, and its oscillations begin at time t, (before that the field evolves in the 
slow roll regime, see Sec. 4.8.1). Hence, there is no relativistic matter at all at time 
tą. One assumes further that the decay width of the scalar particles again obeys 
t. < T71, and that these decays is the main mechanism of the decay of the scalar 
field oscillations. In fact, the latter assumption is pretty strong: there are alternative 
mechanisms leading to the decay of coherent oscillations of scalar fields, which are 
often more efficient. We discuss some of these mechanisms in the accompanying 
book, and here we will proceed under the above assumption. Then, like in the first 
scenario, the Universe becomes radiation dominated when H(t) ~T. 

The analysis of the two scenarios goes in parallel up to some point. Let pm and 
Prada be energy densities of non-relativistic and relativistic component, respectively. 
Let us assume for simplicity that the effective number of relativistic degrees of 
freedom g, does not change in time (the opposite case is the subject of the problem 
in the end of this Section). The number of heavy particles in comoving volume 
(nma?) decreases only due to their decays, hence 

d 3 


Gina) = -Inma . 


Since pm X nm, we find 


PM + 3Hpm = —Tpm. (5.41) 
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Energy density of relativistic particles decreases due to the cosmological expansion, 
since their number density gets diluted and energy of each particle gets redshifted. 
However, energy is injected into this component due to decays of heavy particles. 
Hence, equation for praa has the form 


braa + 4 praa = Vp. (5.42) 


The third equation that closes the system of evolution equations is the Friedmann 
equation 


87 
H? = 3 Glem + Prad): (5.43) 
Our purpose is to study the solutions to the system (5.41)—(5.43). 


Problem 5.9. Making use of Eqs. (5.41) and (5.42) show that the total energy and 
pressure (sums of energies and pressures of the two components) obey the covariant 
conservation equation Pitot = —3H (prot + Ptot)- 


In the first place, the solution to (5.41) is 


const _ r 
t) = = i 5.44 
pm (t) aT) (5.44) 


At both radiation and matter domination one has t ~ H~!, so the energy density of 
non-relativistic component is small at late times, when T >> H(t). In this regime the 
right-hand side of Eq. (5.42) can be neglected, and the energy density of relativistic 
matter falls as a~*. This is power-law rather than exponential decrease, hence at 
T >> H(t) one has praa > pm. Thus, the expansion is indeed radiation dominated 
at late times. The time of transition from matter domination tmp—prp is roughly 
estimated as 


T ~ H(tmp—rp).- (5.45) 


The above reasoning does not exclude the possibility that the right-hand side of 
(5.45) contains a logarithmically large factor, so that the transition occurs somewhat 
later; neither it excludes earlier transition, at time when T < H. We will see that 
neither of this possibilities is realized. 

Let us consider solutions to Eqs. (5.41)—(5.43) after the beginning of matter 
dominated stage, t > tą, but at sufficiently early times when T < H(t). In other 
words, we are interested in the time interval 


teL trh 


Let us assume that the expansion is matter dominated in the whole interval; we will 
justify this assumption a posteriori. At these times the exponential factor in (5.44) 
is nearly equal to 1, i.e., the loss of particles due to their decays is negligible. Hence, 
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the evolution of the Universe is described by formulas of Sec. 3.2.1; in particular, 
a x t?/3 and 


fad ME a E. 
“ap PM- gna”  GrGe 


By substituting the latter expressions into (5.42), we obtain the equation 


. T 
Prad + 3p Pred = GE 
Its general solution is 
r 1 C 
Tad S ano TR 73 0) A 
Prod = rat BE da 


where C is an arbitrary constant. This constant is determined by initial data and 
it is different in the two scenarios outlined above. 

Let us begin with the first of them. We will see that at the time tą of the 
transition to matter domination, the first term in (5.46) is small, and the property 
that Praalti) = pm (ts) gives 


Hence, the behavior of praa at t > ty is 


T 1 1 ¢F 


IG t t GrG Er T Poen + Pinit- (=ar 


Prad = 
The first term here is due to injection of energy through decays of heavy particles, 
while the second is the energy density of the relativistic matter existing in the 
Universe initially. Note that the first term decreases in time relatively slowly, while 
the second term decays as a~*, as should be the case. Our basic assumption T « ty! 
implies that the first, induced term is indeed small at t = tx; it starts to dominate at 
t ~ t 5T-3/5 < T71, i.e., long before the transition back to radiation domination. 
At the latter time, efficient generation of entropy begins. 

Let us point out that praa, and hence the temperature of relativistic component, 
monotonically decreases in time. This may be a surprise: one might naively expect 
that maximum temperature in the Universe is reached at H(t) ~ T, when most of 
heavy particles decay. The above analysis shows that the smallness of the relative 
number of heavy particle decays at early times is compensated for by high number 
density of these particles, so the contribution of the decays into praa is not small at 
small t. 


Problem 5.10. Show that temperature of the hot component monotonically 
decreases at H(t) <T as well. 
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In the second scenario, Praa = 0 at t = tx, which gives 


r f #2 
Prad = ME G = S=) . (5.48) 


Energy density of hot matter rapidly (during time of order tą) increases from zero 
to its maximum value” 


I 
Prad,maz ~ 107Gt,.’ 


and then decreases as t-!. The maximum temperature is reached soon after the 
beginning of matter dominated stage. 

In both scenarios one has praa < pm at times t < rt. The energy density of 
hot component approaches that of heavy particles as t approaches typ—+rp ~ T7}, 
i.e., the time when H ~T. This means that the relation (5.45) is valid without large 
logarithms. Right after the transition to radiation domination one can use formulas 
of Sec. 3.2.2 for estimates, so the relation (5.45) can be written as 


Pa TMD—RD 
Mp, 
It is known from the analysis of Big Bang Nucleosynthesis (see Chapter 8) that the 
expansion was radiation dominated at temperatures of at least Tggy ~ 1 MeV, so 
there is a bound Tyyp-.rp > Tggyn. This bound gives the bound on the lifetime of 
heavy particles, 
-1 < Mpr(Tegn) 


~N 


4 
II 
= 


~Ils. A 
Thx s (5.49) 
This bound is important, e.g., for some models with heavy gravitino, see Sec. 9.6.3. 
Decays of heavy particles is a strong source of entropy in the above scenarios. 
In the first of them, entropy density at the time of the transition back to radiation 
domination is mostly due to these decays, and its value iS Sgen ~ pola (t =F"), 
If there were no heavy particles, the entropy density would be equal to Sinit ~ 


roe (t =[—'), see (5.47). This gives the entropy growth factor 


Sgen 1 


~N 


Sinit Tt, 


Clearly, this factor is large for tą < I~! i.e., entropy generation is very effective. 
In the second scenario, there is no entropy in the beginning at all, so all entropy 
comes from decay processes. 


7Since the reheating is fast, the approximation of instantaneous beginning of matter dominated 
stage is not adequate. For this reason, the numerical coefficient in (5.48) depends on the details of 
evolution at the beginning of that stage. 
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Problem 5.11. Generalize the analysis of this Section to the case when effec- 
tive number of relativistic degrees of freedom g, changes in time. Hint: Instead of 
Eq. (5.42), derive and use similar equation for entropy of the hot component. 


5.4. *Inequilibrium Processes 


We often encounter in this book a situation when most processes in cosmic medium 
are fast, but there is one or several slow processes whose low rate keeps the medium 
out of complete thermal equilibrium. 

Let us give a typical (but by no means unique) example of such a situation. 
Let there exist new long living heavy particle X (say, of mass mx 2 100GeV), 
and the only processes by which the number of X-particles and their antiparticles 
X in comoving volume can change are their pair creation and annihilation. Let 
us assume that scattering of X-particles off the Standard Model particles (quarks, 
leptons, photons, etc.) occurs at sufficiently high rate. At temperature T exceeding 
the mass of X-particles mx, creation and annihilation of X-particles are typically 
also rapid, so they have equilibrium abundance. Thus, the system is in complete 
thermal equilibrium at T Z myx. Let us consider symmetric medium with equal 
number of particles X and their antiparticles X. At T < mx, the number density 
of X-particles exponentially decays as temperature decreases, and the annihila- 
tion of X-particles occurs less and less often simply because the probability for an 
X-particle to meet its antiparticle becomes smaller and smaller. This is an exam- 
ple of the situation we are going to consider: at T < myx all processes except for 
annihilation and creation of X-X-pairs are fast, while the latter processes are slow, 
and the abundances nx and ng = nx may differ from the equilibrium abundance. 
In this case the medium still has well-defined temperature and numbers of particles 
(including numbers of X and X themselves). The distribution functions are given 
by formulas of Sec. 5.1; the fact that the system is out of thermal equilibrium is 
reflected by the property that wx Æ 0, and wx = ux for symmetric matter (recall 
that x = 0 for symmetric matter in equilibrium). 

Leaving aside for a while the cosmological expansion, let us discuss how the 
system relaxes to thermal equilibrium in these cases. In general, as the system 
relaxes, its effective temperature and number densities of all particles evolve in time. 
Temperature changes, if the relaxation occurs with heat release; this is the case in 
our example if the X-particle abundance is higher than in equilibrium, nx > n, 
so the relaxation proceeds via annihilation (rather than creation) of X-X-pairs. 
Still, at any moment of time, temperature and chemical potentials take well-defined 
values. So, at any time the system has a certain free energy F(T, N;).° Let us recall 


8It is often more convenient to use Grand potential rather than free energy, considering chemical 
potentials rather than particle numbers as the thermodynamical parameters. In this Section, how- 
ever, we will discuss states with fixed number of particles. The formula (5.50) is valid precisely for 
these states. 
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that free energy is related to the number of states (partition function) by 
Z = eT, (5.50) 


where we assume that the system has finite, albeit large, volume. As the system 
approaches thermal equilibrium, free energy tends to its minimum, and partition 
function to its maximum. 

The system relaxes to thermal equilibrium due to “direct” slow process (in our 
example this is X-X-annihilation; we assume for definiteness that nx > n$). Let 
us denote the rate of these processes (number of their occurrences per unit time per 
unit volume) by [4.9 There are of course inverse processes whose rate is denoted 
by I'_ (these are processes of X-X-pair creation in our example). Let us find the 
relation between [4 and [_. 

As a result of a single direct process the system makes a transition from a state 


with smaller partition function Z_, and hence larger free energy F_, to the state 
with larger partition function Z} and smaller free energy F}. Let i— be one of 
microscopic states!? before the direct process, and j} be one of microscopic states 
that can appear after the direct process. The probability that the system is in the 
initial state i_ is 


e-Ei-)/T 
P(i_) = ——— 
i- =, 
where E(i_) is the energy of the state i_ (It is important here that the system is in 
thermal equilibrium with respect to all processes except for the one we consider.) Let 
y(i_ — j+) be the probability of the process i_ — j+. Then the total probability 
of the direct process in the system is 


. : 1 -Eli ; . 
r= >) Pe) Sp a 
i— j+ ~ 4454 
Similarly, the total probability of an inverse process is 
1 


7 5 eT BU+)/P (j — i_). 
Ly. 


i—,j+ 


Tr- 


The probabilities of the process i_ — j+ and the inverse process j} — i— are equal 
to each other, y(j+ —> i_) = yli- — j+). Making use of this fact, and also the 


9The notation has to do with the fact that the direct process increases the value of the partition 
function; this process is thermodynamically favored. 

10As an example, in classical mechanics this is a state in which position and velocity of every 
particle are well-defined. 
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energy conservation, E(i_) = E(j+), we obtain 


eo es te ye 
TZ i 


and finally 


a eSEE. (5.51) 
T_ 

where AF is the difference of free energies after and before the direct process; 
according to our definitions, AF < 0. In what follows, Tr} and F denote the 
number of direct and inverse processes occurring per unit time per unit volume. 
Clearly, the formula (5.51) is valid for these quantities too. The relation (5.51) is 
known as the general formula of detailed balance. 

In our example, we have 


hie E a 


——ANx 
ON x ON<x a 


where AN, = AN; = —1 is the change in the number of X- and X-particles in a 
single direct process (annihilation). Hence, 


AF = —2y1x, (5.52) 


where we made use of the facts that OF/ONx = ux and px = Hx. 

The relation (5.51) shows, in the first place, that free energy indeed decreases 
during relaxation: Ty > T_ for AF < 0. Now, if |AF| > T then Ty >T., so 
the inverse processes are negligible. Conversely, if |[AF| « T then the deviation 
from thermal equilibrium is small, (T4 —T_) < T+, so direct and inverse processes 
occur at almost equal rates. In the latter case, the rates are approximately the same 
as in equilibrium r4 ~ Ir- ~ T°, so the relaxation rate is given by 


AF 


In the situations we discuss, it is often possible to introduce local (independent 
of volume V) parameter characterizing the medium, n = N/V. In our example this 
is the number density of X-particles, while N is the total number of X-particles in 
the system. The quantity N does not change in fast processes and changes by AN 
and (—AN) in each direct and inverse process. Still leaving aside the expansion of 
the Universe, let us write down the equation for n(t), 


dn 
dt 


(recall that [+ are the rates per unit volume). The free energy as function of N is 
at minimum in thermal equilibrium, so 


(Ary — e 


=AN. (T4 -T_) (5.54) 


eq 
— -AN =0. 
ay) AN=0 
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Near thermal equilibrium, AF is proportional to the deviation of n from its equi- 
librium value n11, hence 


AF = —a (n= n°1), 


where a is a positive parameter. Equation (5.54) then takes the form 


dn a 
— = ANT" (n—n%). 5.55 
Te = GANT (n — n°) (5.55) 
Since n does not depend on time in thermal equilibrium, 
dna 
d 


Equation (5.55) can be written as 
d(n— n°?) 
dt 


It is now clear that the relaxation to thermal equilibrium has exponential charac- 
ter. 


In our example of symmetric medium, the abundance of particles X and X at 


= FANT" (n — n°3). (5.56) 


T < mx is given by (5.18) with non-vanishing ux = wx, hence 
nx = ny = #*/T ne, 


Thus, near thermal equilibrium, i.e., at £% « 1, Eq. (5.52) gives 


eq 
Ny —7n 
AF = —2ux = —2T——,— 
Nx 
We see that a = — and, since AN = —1, the relaxation to thermal equilibrium is 
x 
described by the equation 
dinz- na pae e 
X 


Finally, let us note that the equilibrium rate of annihilation processes per unit 
volume is 


T4 = PS4 . n°, 
where Ix eg = Tx! is the lifetime of X-particle in the equilibrated medium. We get 
= -0% (nx — n48), 

which is the Boltzmann equation for the number density in our example. 


11Note that the right-hand side of Eq. (5.56) is always negative: for n > n°? the direct process 
decreases n, i.e., AN < 0, and vice versa. 
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The latter result can be obtained also in an elementary way. Let nx and nx be the 
number densities of particles and antiparticles. Then the probability of annihilation of a 
given particle X with some antiparticle X per unit time is 


Tx = Oan ng: Ux, 


where vx is X-particle velocity, and Gann is the annihilation cross-section (during time 
Tx =x! a particle of cross-section Cann meets precisely one antiparticle X). Hence, the 
annihilation probability per unit time per unit volume is 


DST ens Seu ie ing ST a, (5.58) 


The probability of the inverse process of X-X-pair creation is independent of the actual 
number of X-particles in the medium, so it is given by 


Pas Rae 
Hence 
d (nx — næ) eq NXNx 
a e 
xX “X 
In symmetric medium, ng = nx and nẸ = n%, so for |nx — n&| < ng we recover 
Eq. (5.57). 


Equation (5.54) is no longer valid in expanding Universe. As the Universe 
expands, the parameters like number density decrease as a7’ 
are switched off. So, it makes sense to consider the quantity N in comoving volume, 
N œ na®. Instead of (5.54), we now have the following equation, 

d (na?) 
dt 


where [, and [_ are still the rates of slow processes per unit physical volume, 


even if slow processes 


=AN:- (Fr. —T_)+a', (5.59) 


which in general obey the relation (5.51). Clearly, T} and [_ depend on time, 
since temperature and number densities decrease in time. Near thermal equilibrium, 
Eq. (5.55) is generalized to 


d(na®) a 
Nee, . peq = 09) cae 
ii TAN T°? (n — n°1) +a’, (5.60) 


Equation (5.59) is the Boltzmann equation for the number density in expanding 
Universe. We will repeatedly encounter equations like this in the course of our 
study. 


Chapter 6 


Recombination 


6.1. Recombination Temperature in Equilibrium Approximation 


At temperatures exceeding the binding energy of electrons in atoms, the mat- 
ter in the Universe was in the phase of plasma consisting of electrons, pho- 
tons and baryons. As we discuss in Chapter 8, at temperatures below a few 
dozen keV the baryon component was predominantly protons (about 75% of total 
mass) and *He nuclei (about 25% of total mass). As the Universe cools down, it 
becomes thermodynamically favorable for baryons and electrons to combine into 
atoms. This cosmological epoch is called recombination. Before recombination, the 
plasma was opaque because of photon scattering off free electrons, while after 
recombination relic photons propagate freely. These are the CMB photons we see 
today. 

We are interested in recombination of hydrogen, since recombination of helium 
occurs earlier and has very little effect on properties of photon—electron—proton 
plasma afterwards. The recombination of helium is the subject of Problem 6.5. 

We begin our analysis by determining the temperature at which formation of 
atoms becomes thermodynamically favorable. We will assume temporarily that the 
expansion of the Universe is adiabatically slow. Under this assumption, the medium 
is always in thermal equilibrium. In particular, there is chemical equilibrium between 
its components. This is not an adequate approximation: recombination is not an 
equilibrium process [79, 80], it is slightly delayed. We consider inequilibrium recom- 
bination in Sec. 6.2, and here we restrict ourselves to the complete thermal equi- 
librium. In this way we will understand the relevant range of temperatures and 
cosmological times. We will also obtain the equation of chemical equilibrium (Saha 
equation) which is used in this and other Chapters. We stress again that one should 
bear in mind that the picture we consider in this Section is not directly relevant for 
recombination in the real Universe. 

One could naively expect that the temperature of equilibrium recombination is 
similar to the binding energy of electron in hydrogen atom. This is not the case, 
however: the recombination occurs at much lower temperature. The physical reason 
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is as follows. At low electron and proton densities, the recombination of a given 
electron with some proton occurs in time! 7, inversely proportional to the number 
density of protons T} œ n;!. The newly formed hydrogen atom is ionized by a 
photon whose energy exceeds the binding energy of hydrogen atom, Ap. There 
exist thermal photons of these energies even at T < Ap, though their number is 
exponentially small. Hence, the lifetime of a hydrogen atom in medium is finite, 
albeit exponentially long, r- œ e4“/7, Recombination becomes effective when 


Te ~ To, (6.1) 


which indeed gives T « Ap for small ng (and small electron velocities which enter 
7). One could find the equilibrium recombination temperature by elaborating on 
this kinetic approach (see Problem 6.4). It is simpler, however, to make use of 
thermodynamical approach which we now present. 

We will see that the temperatures of interest are about 0.3eV. At these temper- 
atures and in chemical equilibrium, electrons and protons are non-relativistic, while 
most hydrogen atoms are in the ground state (see Problem 6.6). The equilibrium 
number densities of electrons, protons and hydrogen atoms in the ground state are 


T\3/? 
3/2 
MT = 
tp = a» (SE) gltp— mp) /T. (6.3) 
p 3/2 


These formulas involve yet unknown chemical potentials. The number of spin states 
of proton and electron equals ge = gp = 2, while for hydrogen atom gy, = 4. 


Problem 6.1. Show that gy = 4. 
The equilibrium recombination temperature T°? is determined from the relation 
npl Te] ~ ny T (6.5) 


To find T$41, we need three more equations that complete the system of equations on 
number densities and chemical potentials at given temperature. One is the baryon 


'Superscripts + and — refer to direct and inverse processes — recombination and ionization, 
respectively. 
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number conservation, 
Np + Ny = Np. (6.6) 
Here and in the next Section (and only in these Sections) ns denotes the baryon 


number density without helium (for total baryon number density we use the nota- 


a tot 
tion nt), 


np(T) = sny(T), te = 0.75Ns. (6.7) 
The baryon-to-photon ratio is given by (1.21), so that 
fie = 4.5 - 1071, (6.8) 


while the number density of photons n,(T) is a known function of temperature; see 
(5.16a). Another equation, 


Hp + He = Hn, (6.9) 
follows from the assumption of thermal equilibrium for the reaction 
pteoH+y7. (6.10) 
The last equation expresses electric neutrality of the Universe, 
Np = Ne. (6.11) 


Thus, we have six equations, (6.2), (6.3), (6.4), (6.6), (6.9) and (6.11), determining 
three number densities ne, np, Ny and three chemical potentials pe, Hp, Hy at given 
temperature T. The quickest way to simplify this system of equations is to multiply 
Eqs. (6.2) and (6.3): 


3/2 3/2 
= MpL MeT (Mp tHe -~Mp—Me)/T 
ee YpYe 2T 2T Š ` 


Now, using (6.4), (6.9) and (6.11) we get rid of chemical potentials and write the 
latter equation in the form 


MeT 3/2 
b= (SE) mean, (6.12) 
where 


An = Mp + Me — My = 13.6 eV 


is the binding energy of hydrogen atom, and we neglected the difference between mp 
and m, in the pre-exponential factor in (6.12). Together with Eq. (6.6), equation 
(6.12) determines the equilibrium proton and hydrogen densities np and ny. 
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It is convenient to introduce dimensionless ratios 


Then the baryon number conservation equation is 
XptXy=1. (6.13) 


Using Eq. (6.12) we express X,, through X,, and then Eq. (6.13) becomes the 
equation for the only unknown Xp, 


a 3/2 ay 
Xp +ns X; (=) eT =1. (6.14) 
Equation (6.14) (as well as Eq. (6.12)) is known as the Saha equation. It is 
convenient to express nz through the baryon-to-photon ratio Ĥs by making use 
of (6.7) and then use the formula (5.16a) for photon number density n,(T’). This 
leads to the equation containing dimensionless quantities only, 


2 anT \ 
pore (=) Xe F = 1, (6.15) 


T2 z 


The second term here is the equilibrium number density of hydrogen atoms 
expressed in terms of Xp, 


2 2T \ z 
Xu = Phi ( 7 ) Xe, (6.16) 


Me P 


We now see explicitly that recombination occurs at temperatures well below the 
binding energy A,,. Indeed, unless the exponential factor in (6.16) is very large, the 
equilibrium proton abundance X, is close to 1, while the hydrogen abundance X y is 
small. The suppression is due to both smallness of the baryon-to-photon ratio 7; and 
the fact that electrons are non-relativistic at temperatures of interest, T/Me < 1. 
In chemical equilibrium, recombination begins when Xp ~ 1 and X, ~ 1 at the 
same time. At that time the suppression by pre-exponential factors in (6.16) is 
compensated for by the exponential factor e4#/7. This occurs when A, /T is much 
greater than 1, 


eq /2 
An a — log EZ (= ) | (6.17) 


Tet 12 Me 


The latter equation is obtained by substituting X,(Tf!) ~ Xp (T1) ~ 1 into 
Eq. (6.16). Equation (6.17) determines the equilibrium recombination temperature. 
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It is useful at this point to discuss solutions of equations of the type (6.17). This 
equation belongs to the class of equations of the following form, 


x = log (Ax*). (6.18) 


In our case the unknown z is 


while parameters A and a are given by 


3 o T e 
a= A= RB) (x) 


It is important that 
log A > 1. (6.19) 


In the leading logarithmic approximation, i.e., treating log A as a large parameter, 
we can set x = 1 in the right-hand side of Eq. (6.18) and immediately obtain the 
solution 


x & log A. (6.20) 
Problem 6.2. The solution x can be written as 
x = (1 + e) log A. 


Find the correction e to the first non-trivial order in (log A)~' and show that it is 
indeed small provided that a ~ 1. 


Coming back to recombination, we obtain in the leading logarithmic approxi- 
mation 


Aun 
p , Va ( Me ) 3/2 4 
oe | — 
Savaa \An) " 

The numerical solution of (6.17) with the effective value Ĥs presented in (6.8) gives 
Ted = 0.33 eV. 

By comparing the equilibrium recombination temperature (6.21) to the temper- 
ature at equality (4.21b) we see that recombination occurs at matter-dominated 
stage. This is true also for realistic, out-of-equilibrium recombination, since the 


absence of thermal equilibrium can only delay the process as compared to the equi- 
librium situation. 


ed w 


= 0.38 eV. (6.21) 
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Let us find the age of the Universe at T9 = 0.33 eV. To this end, we use the 
relation (3.22) valid at matter domination and write 


2 2 
(8) = eT 6.22 
eo gM 69S SITATA e 


Numerically t$ ~ 330 thousand years, while the Hubble time is 
H~'(T£4) ~ 490 thousand years (6.23) 


for Te? = 0.33eV, Qu = 0.315. This estimate can be refined by making use of the 
result (4.27), but we do not need that here. 

Later on the proton abundance is small, and we can neglect the first term in the 
left-hand side of Eq. (6.15). It is then clear that the equilibrium proton abundance 


exponentially decreases as the Universe cools down, Xp « e~44/2T | More precisely 
3/4 
1/2 (M T —Ap /2T 
nyt = nel = ng (4) gauet, (6.24) 


This formula will be useful in what follows. 


Problem 6.3. Find chemical potentials He, Hp, Hu in equilibrium at temperature 
T4 = 0.33 eV. Compare them with masses of electron, proton and hydrogen atom. 


Problem 6.4. Give alternative derivation of the equilibrium recombination tem- 
perature based on kinetic approach and the relation (6.1). 


Problem 6.5. Find equilibrium recombination temperature for helium. Hint: Do 
not forget that helium nucleus has to catch two electrons to make a neutral helium 
atom. 


Problem 6.6. Find equilibrium abundances of hydrogen atoms at 2s- and 2p-levels 
at temperature T$ = 0.33 eV, relative to the abundance of atoms in the ground state 
ls. Disregard higher levels in this calculation. 


6.2. Photon Last Scattering in Real Universe 


In the previous Section we defined the “moment” of recombination as the time at 
which the equilibrium abundances of free protons and hydrogen atoms are equal. 
In fact, this is not quite the “moment” of primary interest for cosmology. Really 
interesting is the epoch at which photons scatter for the last time, and after which 
relic photons propagate freely through the Universe. We are talking here about 
Compton scattering of photons off free electrons, 


ye — ye. (6.25) 


The cross-section of this process is determined by the diagrams of quantum electro- 
dynamics (QED) shown in Fig. 6.1. We are interested in temperatures, and hence 
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Fig. 6.1. Feynman diagrams for Compton scattering. 


photon frequencies, much lower than the electron mass. In this case the Compton 
cross-section reduces to the Thomson cross-section known from classical electro- 
dynamics (the study of Compton scattering in QED can be found, e.g., in the 
book [81]; the derivation of the Thomson cross-section in classical theory is given 
in the book [83]). The Thomson cross-section is 

8T a? 


Or = — 
3 m2 


x 0.67 - 10-74 cm?. (6.26) 


The mean free time of photon with respect to the Compton scattering is 


1 


Ty = 


where ne is the number density of free electrons. At the time when the abundances of 
free protons (and hence free electrons) and hydrogen atoms are equal, np = ne ~ NH, 
the electron number density is 


NB (3) a p3 
eT Qo? BI". 
For T = 0.33eV this is equal to 
ne ~ 250 cm”. (6.28) 


The photon mean free time (6.27) in this situation equals 7, ~ 2 - 101 s ~ 6000 yrs. 
This time is much smaller than the Hubble time (6.23). Hence, photon last scattering 
happens somewhat later, at smaller density of free electrons. We will see that the 
temperature at last scattering is T, = 0.26eV, and the purpose of this Section is 
precisely to calculate this temperature. 

At temperatures of interest to us, there is kinetic equilibrium between photons, 
electrons, and also protons. We will show that in Sec. 6.3. On the contrary, chemical 
equilibrium does not hold. The latter property can be understood by noticing that 
there are very few thermal photons capable of ionizing hydrogen from the ground 
state, i.e., thermal photons with energies w > Ay. Indeed, the number density of 
photons of energies above a given value w is (hereafter the superscript eq refers to 
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thermal, equilibrium quantities) 
nêt (w) = y FS (w")dw', (6.29) 
where 

F (w) = STW’ fpi(w) 


and fp;(w) is the Planck distribution (that is the Bose-Einstein distribution for 
zero mass; we recall that photon has two polarization states). At w >> T we have 


eq w? —w/T 
F% (w) = ae ; (6.30) 
and 
2 
T 
nêt (w) = S e—/T, (6.31) 


T 


For w = Ay and T = T, = 0.26eV this number density is n$ (Ap) ~ 107° cm~”. 
So small number of energetic photons is insufficient for maintaining chemical equi- 
librium. We will see in the end of this Section that there exist more numerous 
non-thermal photons with w > Ayp; this fact will unequivocally prove that the 
reaction of recombination to the ground state is out of thermal equilibrium. 

Reactions of transition to the ground state 1s from excited states are also out of 
equilibrium. We will see that later on. On the other hand, chemical equilibrium does 
hold for free electrons and protons and hydrogen atoms at excited levels. Indeed, the 
number density of photons capable of ionizing, say, 2s- and 2p-levels is n$ (A2) ~ 
10°cm~%, where Ag, = Ay /4 is the binding energy of 2s- and 2p-levels. This 
is much greater than the baryon number density, which serves as a hint towards 
thermal equilibrium: even if all electrons and protons recombined into 2s- and 2p- 
levels, the photon spectrum in the region w ~ Ag, would not change, and would 
remain thermal. Later on we will further discuss the conditions for this partial 
chemical equilibrium; see Problem 6.8. 

So, the system is in thermal equilibrium with respect to all reactions except 
for the reactions of production and destruction of hydrogen 1s-atoms. It is useful 
to note at this point that if 1s-atoms were not produced at all, the abundances 
of excited atoms would be small, and practically all electrons would be free at 
T = T, = 0.26eV. 


Problem 6.7. Prove the last statement. Hint: Make use of the formulas of Sec. 6.1. 


The last remark shows that electrons are mostly either free or bound in 1s- 
atoms; as we discussed above, photon last scattering occurs when the number of 
free electrons in small, so most of the electrons occupy 1s states. Therefore, the 
key processes are production and destruction of 1s-atoms. In what follows, we will 
take into account 1s-, 2s- and 2p-levels only, and ignore the higher levels. This is 
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actually a reasonable approximation, since the equilibrium ratio of the occupancy 
of 3s-levels to 2s is 
M88 @—(Aze—Ass)/T 19-8, (6.32) 
N28 
Hereafter nas, ngs, etc., denote number densities of atoms at the corresponding 
levels; the atoms themselves will be denoted simply as 1s, 2s, etc. When writing 
(6.32) we used As, = Ay /9. 
Within our approximation, 1s-atoms can be produced and disappear in the 
following reactions, 


e+pols+y, (6.33) 
2s > 1s+ 24, (6.34) 
2p ls +y. (6.35) 


Let us stress that 2s — 1s transition is a two-photon process, while transitions 
from continuum and from 2p-state occur by emitting one photon. The 2p — 1s 
transition produces a photon in the first line of Lyman series, Ly-a. 

Let us begin our analysis from the transitions from continuum (6.33). These turn 
out to be basically irrelevant (!). In the situation of interest, nis Z me, a photon 
emitted in the process e + p — 1s + y ionizes a “neighboring” 1s-atom before 
losing its energy in collisions with electrons. Indeed, the ionization cross-section 
near threshold for 1s-atom is [81] 

9,2 
am l _ 63.1078 em?, 


Gis =A 
3e4 am? 


and it is much larger than the Thomson cross-section (6.26). Also, we will see in 
the end of this Section that photon redshift caused by the cosmological expansion 
is also inefficient for the reactions (6.33). Hence, the processes (6.33) effectively do 
not change the numbers of free electrons and 1s-atoms. 

Let us now consider the processes (6.34). Unlike in recombination of free elec- 
tron and proton, there are two photons produced in the direct process, so their 
appearance in the medium does not trigger the inverse reaction. The width of the 
direct process 2s — 1s + 2y is [84] 


To, asas, 


This width is small, so the direct process is slow in the sense that 2s-atom more 
likely gets ionized by a thermal photon than makes this transition; see Problem 6.8. 
This means that 2s — 1s transitions are unimportant from the viewpoint of the 
abundance of 2s-atoms relative to electrons, and there is indeed partial thermal 
equilibrium in the medium, as we discussed above. 


Problem 6.8. Show that at T > 2500 K the probability that 2s-atom gets ionized 
in the reaction 2s + 7°" — e+ p is higher than the probability that it becomes 
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1s-state due to the process 2s > 1s + 2y. Here ~t” denotes thermal photon. Hint: 
Use ionization cross section near threshold 
91672 1 97 


= — 0ls ~ 2.34 O15. 


025 = 
et 


We note that the temperature 2500 K is lower than T, = 0.26 eV = 3000 K, so the 
approximation of partial equilibrium is indeed valid at photon last scattering epoch. 
We note also that the typical time scales t S< iA are much smaller than the time 
scale of the cosmological expansion, so the expansion does not spoil the equilibrium 
for processes e+ p œ 2s + y. 


Since 2s-atoms are in equilibrium with free electrons and protons, they obey the 
Saha relation (cf. Sec. 6.1) 


— PF (6.36) 


The transition rate to 1s-level via 2s — 1s channel is determined by the number of 
2s-atoms and the width of 2s-1s transition, so the rate at which free electrons leak 
into 1s-states via this channel is 


(=) oe (6.37) 
2s—1s 
Here we neglect the inverse process which converts 1s-atom into 2s-state via absorp- 
tion of two photons. We justify this approximation in the end of this Section. 
Finally, let us discuss the processes (6.35). Similarly to free electron recombina- 
tion, Ly-a photon emitted in the direct process has high probability to get absorbed 
by “neighboring” 1s-atom in the inverse process, leaving no change in the medium. 
However, redshift of photons due to the cosmological expansion is important in the 
case (6.35). The point is that there are many Ly-a photons in the medium. Some 
of them get redshifted away from the Ly-a line and no longer participate in the 
inverse process 1s + y — 2p. This results in the overall increase of the number of 
1s-atoms and decrease of the number of free electrons and protons. We will con- 
sider this effect at quantitative level later on, and here we simply quote that the 
corresponding effective rate is numerically similar to the estimate (6.37). Hence, the 
number density of free electrons decays according to 


dne 

dt 

where Iep (T) mildly depends on temperature; see Eq. (6.50), and Teg (Tr) = 2T as. 

Making use of (6.36) we now write Eq. (6.38) as the equation for the number 
density of free electrons, 


3/2 
ane = -Tep (= ) / eR on? 
Me 


= -Tep (T) nas, (6.38) 
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Neglecting the time-dependence of the pre-exponential factor and changing variable 
from t to 1/T according to (3.34) we find the solution 


1 VegT {20 \3? a 
— = i -e T + const. 


ne AoH Tme 


The first term here grows very fast, so the integration constant is irrelevant at t ~ tp. 
Recalling that Ao, = Ag /4, we obtain finally 


AnH (Tma\’? 
-25 - ) eH (6.39) 


ne(T) 


Clearly, the result of non-equilibrium analysis is quite different from the equilibrium 
formula (6.24), as both exponential and pre-exponential factors are different. To 
avoid confusion we recall that the formula (6.39) is valid at T > 2500 K = 0.22eV. 


Problem 6.9. Show that at recombination, when ne/ng < 1, the inequilibrium 
density (6.39) always exceeds the equilibrium one (6.24); consider only temperatures 
T > 2500 K, for which the above analysis is valid. This means that the inequilibrium 
recombination is delayed as compared to the equilibrium one, as expected. 


Let us come back to the photon last scattering. The average number of collisions 
of a photon with free electrons from time t to the present time (the latter may be 
treated as infinity for this purpose) is 


Ne= f ornedt’. 
t 


Let us define the time of last scattering as the time since which photon scatters 
once, N(t,) = 1. Changing the integration variable to temperature, we have for the 
temperature of last scattering Tr, 


T, 1 
T dT AT; 
1= e= = — elI): 6.40 

f mate AT’ Bir jAg * a ( ) 


when evaluating the integral we made use of the strong dependence of the exponent 
in integrand on temperature; see (6.39). This gives the equation for Tp, 


OT ` 
Interestingly, T; depends on cosmological parameters very weakly; such a depen- 
dence exists in Tep (T) only. The result of numerical solution is 


T, = 0.26 eV. 
At the last scattering epoch, the Hubble parameter is 


H(T,) ~ Vm Ho(1 + zr)? ~ (720 thousand years)™! = (0.22 Mpc)! (6.41) 
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and the age of the Universe is tp = (2/3)H~' = 480 thousand years. The precise 
value is obtained from (4.27), which gives 


t, = 370 thousand years. (6.42) 
Problem 6.10. Obtain the estimates (6.41) and (6.42). 


Problem 6.11. Show that the Hubble rate at temperature T, = 0.26 eV, taking into 
account the contribution of radiation into the energy density, is 


H(T,.) = (650 thousand years)~' = (0.2 Mpc)’ . 
The number density of free electrons at last scattering is obtained from (6.40), 
ne(tr) ~ 30 cm™”. (6.43) 


This is considerably smaller than the electron number density at the beginning of 
recombination; see (6.28). 

Let us make a remark here. Of course, recombination does not happen instan- 
taneously: the electron abundance (6.39) smoothly decreases in time. Yet the main 
change in the number density of free electrons occurs in a small interval of temper- 
atures, 


AT & T,. 


This property is due to fast evolution of the exponential factor in (6.39). The interval 
AT can be defined by the requirement that the exponential factor in (6.39) is e times 
larger and smaller than the central value at the ends of this interval, i.e., 


A(T, + AT) 4T, 


This gives 
AT AT, 


we 


Ty An 


~ 0.1. 


Thus, the number density of free electrons falls considerably when the temperature 
decreases by several percent. In other words, the process of recombination occurs 
during a fraction of Hubble time, At < H7'(t,). 

Photons that scattered for the last time at t = t, come to us from a certain 
distance; by observing CMB we obtain a photographic image of a certain sphere 
which surrounds us. This sphere is called surface of last scattering; according to our 
discussion, it has small but finite width. 


Problem 6.12. Show that the mean free time of a photon with respect to scat- 
tering off neutral atoms exceeds the Hubble time at recombination and afterwards. 
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This means that the presence of neutral hydrogen and helium does not affect CMB 
photons. 


Let us consider the contribution of processes (6.35) into the production rate of 1s-atoms, 
or, equivalently, the rate of disappearance of free electrons. This contribution is due to the 
photon redshift caused by the cosmological expansion. The decrease of the number of Ly-a 
photons created in the direct reaction and participating in the inverse reaction is given by 
the number of photons leaving Ly-a line, i.e., crossing the low-frequency boundary of this 
line in unit time, 


~ (wx) = -w HF, (we), (6.44) 


where n+(w) is the number density of photons with energy greater than w, Fy(w) = 
dn(w)/dw is the photon spectral density (note that the equilibrium values of these quan- 
tities enter (6.29)), while w is the lower boundary of the Ly-a line; we will see that the 
precise value of w, is irrelevant for the calculation. Photons of energy below wą cannot 
excite 1s-state into 2p, so the effective increase of the number of 1s-atoms and decrease of 
the number of free electrons are given by 

dnis dne dny (wx) 


d dt dt — (6.18) 


Since the Universe expands slowly, very few photons exit the Ly-a line, so the direct 
and inverse processes are to very good approximation in thermal equilibrium. Thus, the 
spectral density Fy(w) can be calculated in the approximation of equal rates of direct and 
inverse processes, 


Tapy(w)napdw = C(w) p(w) Fy (w)nisdw. (6.46) 


The left-hand side of this formula gives the number of photons emitted in the process 
2p — 1s + y per unit time per unit volume in the frequency interval (w,w + dw), 2p is 
the width of 2p-level, p(w) is the shape of Ly-a line. The right-hand side is the absorption 
rate of photons by 1s-atoms. We used the fact that this rate is proportional to the photon 
spectral density and to the number density of 1s-atoms, so that the coefficient C depends 
on w only. It is given by 


n’ Top 
w2? ` 


Cw) = (6.47) 


Note that A(w) = Papy(w) and B(w) = C(w)y(w)/w are nothing but the Einstein coeffi- 
cients known from the theory of radiation, and the equality (6.47) is the Einstein relation, 
see the book [81]. 


Problem 6.13. Derive the relation (6.47). Hint: Make use of the fact that equality (6.46) 
is valid in thermal equilibrium and consider the low temperature regime T & w. 


The relations (6.46), (6.47) give 
2 
Fy(w) = = —. (6.48) 


This is a smooth function of w, so one can set wx = Ay — Ag, = 3Az,/4 in (6.44). Also, 
most protons are bound in 1s-atoms at last scattering epoch, so we can take nis © npg. 
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Making use of Eqs. (6.44) and (6.45) we find that the decrease of the number of free 
electrons due to reactions (6.35) is 


dne H (3An\? 
(5) , me (5) N2p- (6.49) 
p—1s 


Finally, at partial thermal equilibrium we have n2p = 3 n2s according to the degeneracy of 
the levels. Adding (6.49) to (6.37) we obtain the formula (6.38) with 
H (3An ) 


Deg = T2s +3—, (5 (6.50) 


NBT 4 
At T = T, = 0.26 eV and npg = 0.75-6.05. 1072n, the second term here equals 7.1 st, so 
that Tep (T-) = 14.3 s7! = 1.85Tis. By the end of recombination, about half of electrons 
get to 1s-level via 2s — 1s transitions, while another half of electrons get there via Ly-a 
channel. This is consistent with the complete analysis of Ref. [82]. 
An analogous phenomenon exists also in the case of the reactions (6.33). In that case 
instead of (6.46) we have 


A(w)n2 = o1(w) Fy(w)nis. (6.51) 


The rate of the direct reaction of recombination of electron and proton is proportional to 
NpNe = nz which is reflected in the left-hand side of (6.51). The rate of inverse reaction is 
proportional to the number of 1s-atoms, ionization cross-section or and photon spectral 
density at energy w, where w > Ay. Let us recall that the direct and inverse rates are equal 
in thermal equilibrium, and write the equation for the coefficient A(w) (this is nothing but 
detailed balance equation), 


A(w) (n)? = or(w) F (wnis. 


We express A(w) from this equation and substitute it into (6.51). This gives for the spectral 
density at w > Ay 


F (w). (6.52) 


Redshift of photons away from the region w > Ay is given by (6.44) with w. = Ay; the 
same formula gives the decrease of the number of free electrons due to the processes (6.33). 
The ratio of the latter contribution to that of Ly-a processes is given by the ratio of (6.52) 
and (6.48) (modulo the ratio of photon energies, Ag /(Az — A2p) = 4/3), 


dne/dt)ep—+1s FS1(Ap)nit T? Tme 8/2 _ôH 
P Py Wd e 


p tabiii tei Os commit a SS [| aT 
(dne/dt)2p—1s — (net)? A?, 2T ? 


where we inserted nop = 3n2s, made use of (6.36) and omitted factor of order 1. Let us 
now make use of (6.30), (6.24) and set nis ~ ng. We obtain finally 


(dne/dt)ep—is ~ eo at 
(dne/dt)2p—1s — : 


This ratio is small, which means that the processes (6.33) are irrelevant indeed. 

We note in passing that in our case of delayed recombination the number of free elec- 
trons ne is considerably larger than its equilibrium value n¢?, while the number of 1s-atoms 
is smaller than in equilibrium. Therefore, the actual spectral density (6.52) exceeds con- 
siderably the equilibrium spectral density at w > Aw, as we have claimed in the text. 
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Finally, let us justify the approximation which neglects the process inverse to 
2s — 1s + 2y. The inverse process occurs via absorption of two thermal photons. Its rate 
is proportional to the number of 1s-atoms, and in thermal equilibrium it must be equal to 
the rate of direct process (6.37). This immediately gives 


dne eq 1s -—(Ayq—-A T 
( = T2sn5i— = Tasnise (An 2s)/ : 
1ls—2s 


dt 


This process is negligible as compared to the direct one when its rate is smaller than the 
rate (6.37), i.e., for 


=—(Ay—Az,)/T < Nog. (6.53) 


Nise 


Nəs can be found from (6.36), (6.39), while the estimate for nis is nis © ng = feny. 
Inserting the numerical values, one obtains that (6.53) indeed holds. 


Problem 6.14. Make the numerical estimate sketched here. 


To conclude this Section, we point out that Lyman-a photons which are emitted, get 
redshifted and survive, do not lose their energy in scattering off electrons. As we discussed, 
the number density of these photons is of order npy—a ~ Ne/2 (their emission leads to 
recombination of about half of electrons). Temperatures at which they are emitted range, 
roughly speaking, from Ty? = 0,33 eV to T, = 0.26 eV. Let us compare the number of 
Lyman-a photons with the number of thermal photons in the same spectral region. We 
make use of (6.31) and write 


2 
NLy—a Ne T 


2 
wop/T A T” wop/T 
7 > e?r w 3 eels 
ny (wp) 2 w3 T fe m 


2p 

where wap = Ag —ÂA2p = 3A g /4 = 10.2 eV is energy of Lyman-a line. Even at temperature 
Ts? = 0,33 eV this ratio is of order 10, and it is even larger at lower temperatures. Thus, 
the Lyman-a region is dominated by non-thermal photons. Since their emission occurs in 
a relatively wide range of temperatures (and redshifts), the Lyman-a “line” in the CMB 
energy spectrum is fairly wide. Its relative width is estimated as follows: 


Alay: Er = Th io Gas 


W2p Ty 


Nevertheless, this “line” is in principle an interesting feature in the energy spectrum. The 
fact that it has not been detected yet is due to small number of photons in this “line”. To 
quantfy this statement, let us estimate the present brightness of the “line” (energy flux per 
unit frequency v = w/2r per unit solid angle; it is this quantity that is shown in Fig. 1.4, 
as well as in Fig. 6.2): 


NLy—a,0W2p,0 0, 5. ÑBNy,0 2TW2p 
eG 


Al = 
Ar Avp,o An Awep 


2 1 


x 6-107” erg- cm s7} . Hz" - ster™' 


=6-107? Jem ?-s !-Hz!- ster™t. 


This is much smaller than brightness that can be measured so far, see Fig. 1.4. Nevertheless, 
it is expected that future experiments will discover not only Lyman-a “line” but also other 
features in the CMB energy spectrum. The result of the calculation of the CMB energy 
spectrum [87] is shown in Fig. 6.2. 
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Fig. 6.2. Energy spectrum of CMB photons: excess of the total spectrum over the thermal spec- 
trum. We used standard atomic physics notations for hydrogen lines. “Free-bound emission” 
denotes emission in transitions from continuum; “bound-bound transitions + 2s spectrum” is 
emission in transitions between bound states and transition 2s — 1s. 


6.3. *Kinetic Equilibrium 


It is of importance to understand to what extent the electron—proton—photon plasma 
is in kinetic equilibrium before and during recombination epoch. Indeed, our pre- 
vious calculations were made under the assumption that the distribution functions 
have equilibrium forms (6.2)—(6.4) with one and the same temperature equal to the 
photon temperature. 

We will see that electron-electron and electron—proton Coulomb interations 
are most rapid processes which keep electrons and protons in kinetic equilibrium 
(Sec. 6.3.2). So, electrons and protons do have thermal distribution functions with 
one and the same temperature. 

To make sure that the temperature of electron—proton component coincides with 
the photon temperature, we have to study the following effect. We have seen in 
Sec. 2.5 that in the absence of interactions between photons and non-relativistic 
particles, the distributions of all these particles would have thermal form. However, 
the effective temperature of photons would decrease in time slower than that of 
electrons and protons. Thus, we have to check that energy transfer from photons 
to electrons and protons is sufficiently fast. We will see that this is indeed the case 
(Sec. 6.3.1). 

There is a subtlety, however, which concerns photons. The energy transfer from 
photons to the electron—proton component distorts the photon distribution function, 
while energy exchange between photons themselves is slow at recombination and 
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somewhat earlier. Thus, generally speaking, photons do not have Planckian spec- 
trum. This effect is very small, however, because the number densities of electrons 
and protons are small. So, from the viewpoint of recombination, the approximation 
of complete kinetic equilibrium is very good. 

Yet the result that the photon spectrum is not exactly Planckian is of inter- 
est, since the deviations from the thermal spectrum are potentially detectable in 
precision CMB studies. Besides the energy transfer from photons to electrons and 
protons, there are other mechanisms that distort the energy spectrum, see, e.g., 
the end of Sec. 6.2. We will see in Sec. 6.3.3 that photons get out of kinetic equi- 
librium well before recombination, at Ty kin ~ 3> 10° K. Even before that, at 
Ty,chem = 6: 10° K, photons get out of chemical equilibrium: reactions which change 
the number of photons become slower than the expansion of the Universe. Thus, 
in analogy to the sphere of last scattering, one can talk about the “black body 
photosphere of the Universe” [88] at temperature 6- 10° K and redshift 2-10°. Any 
processes at higher redshifts do not distort the CMB spectrum, while processes at 
lower redshifts, generally speaking, do lead to the spectrum distortion. 


6.3.1. Energy transfer from photons to electrons 


Let us begin with electrons. They get energy from photons via Compton scattering 
process (6.25) that occurs with Thomson cross-section (6.26). The time between 
two subsequent collisions of a given electron with photons is T = 1/(o7n,). We are 
interested, however, in the energy transfer. So, we have to find the time Tg in which 
an electron obtains kinetic energy of order T due to the Compton scattering. To 
estimate this time, we note that the typical energy transfer in a collision of a slow 
electron with a low energy photon is actually suppressed. To see this, let us write 
down the 4-momentum conservation equation in the following form, 


Pe + Py — Py = Pe, 


where pe, p+ are 4-momenta of incident electron and proton, while primed quantities 
refer to final particles. The square of this equality gives 


Me(w — w’) — |p,|(w cos 01 — w’ cos 82) — ww'(1 — cos 8) = 0, (6.54) 


where pe is 3-momentum of the initial electron, w, w’ are initial and final photon 
frequencies, 01, 02 are angles between the initial electron momentum and momenta 
of initial and final photon, p} and p/, respectively, while @ is the photon scattering 
angle; see Fig. 6.3. If the electron is initially at rest, the second term in Eq. (6.54) 
vanishes, so the typical energy transfer is 


w2 


AE =u! -w ~ —. (6.55) 


Me 
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Fig. 6.3. Kinematics of Compton scattering. 


From this we would conclude that the number of collisions after which the electron 
has energy of order T is 

T Me 
“AE T 
However, we are interested in the situation where electron kinetic energy Ee before 
the collision is comparable to the photon frequency. In this situation the electron 
3-momentum is 


N 


(6.56) 


|pe| = V2EeMme ~ Vwm. 


Hence, the third term in Eq. (6.54) is negligible, and we obtain the estimate of the 
energy transfer in a single collision, 


AE vw Z. (6.57) 
m 


We note, however, that unlike the third term in (6.54), the second term can have 
any sign: the angles 6; and @2 take random values. Therefore, we use the analogy 
with random walk and estimate the number of scattering events needed to heat up 


a moving electron, 
TA Me 
N n | —] v 
( Z) T 


This coincides with the estimate (6.56) obtained for electron originally at rest.? In 
this way we obtain the time of electron heating, 
Me 


TE(T) ~ N7(T) ~ Torn (T) 


(6.58) 


Using numerical values of parameters in this formula we find that at recombination 
Tg(T,) = 1.8- 108 s = 5.7 years, 


2 Note that for fast electron, the second term in (6.54) gives rise to systematic decrease of electron 
energy, since the number of collisions with photons moving towards electron is larger than that 
with photons “chasing” electron. This is the mechanism of thermalization of fast electron in photon 
medium. 
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which is much smaller than the Hubble time (6.42). Thus, energy transfer from 
photons to electrons is efficient, so electrons (and protons) have the photon tem- 
perature. This is true for earlier Universe too, since the time Tg rapidly decreases 
as temperature grows, Tm «x T~+. 

Energy transfer from photons to electrons leads to slight decrease in energy 
density of the photon gas. We estimate this effect by writing for the density of 
kinetic energy of electrons 


3 . 3¢(3 
Pekin = 5T Me = tip ens) dept 
TT 


Protons have the same kinetic energy. So, the rough estimate of the energy loss by 
photons is |Ap,| ~ (Pe,kin + Pp,kin), and the relative loss of photon energy is 


A 
[eal ng 1079. (6.59) 


Py 


This small loss does not affect the results of the previuos Section, but does lead to 
potentially observable CMB spectrum distortion; see Sec. 6.3.3 and review [88]. 


Let us refine the estimate (6.59). The Compton scattering keeps the kinetic energy density 
of electrons equal to pk'” = 3neT/2. The number density of electrons decreases due to 
the expansion of the Universe, ne x 1/a®. In the absence of the Compton scattering, the 
electron momentum would redshift, pe « 1/a, and the expansion would lead to the rapid 
decrease of the kinetic energy, (p2/(2m)) œ 1/a?. So the effective temperature Tepp would 
fall as Te pp x (p2/(2m)) « 1/a?; see Eq. (2.47). It is the Compton scattering that restores 
the slower dependence T œ 1/a. 

Let us denote the energy density of photons that is tansfered to electrons in time ôt 
by dpy < 0. The above observation leads to the energy balance equation 


a: De 
—dpy = Nes or—n.5( 2) 5 (6.60) 


where the right-hand side involves the changes caused solely by expansion. Since p,T œ 
1/a, one has 


. 2 2 
r= —“" Te =-vATst, Pe = 22e Hôt. 
a 2m 2m 


We insert these expressions into (6.60) and obtain 


_ 3 De _ 3 
—dpy = — (n.37- 2Neð (29) Ht = Nes THot, 


where we use the fact that (p2/(2m)) = 37/2 in kinetic equilibrium. Baryons acquire 
almost the same kinetic energy (charged baryons are in kinetic equilibrium with electrons; 
see Sec. 6.3.2), so the total energy loss by photons equals —dp4?' = (3/2)nnr HT ôt, where 
Nnr is the number density of all charged non-relativistic particles. For simplicity, we include 
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into nn, electrons and protons only,’ nn, = 2Ĥgn,. Thus, the photon energy density obeys 


õp” 3 Sia T 
py +4H py m ginr HiT = 2p pH, 
“p 


where, as usual, the second term in the left-hand side is due to the cosmological expansion. 
Note that 
3nnrT  90¢(3) 


2p = E HB = const. 
y 


This gives for the total change in the photon energy density from time t; to time t 


palt) = Prii (5) R Poi (5) | = ea em (=) l 


So, the relative change in the photon energy density due to energy transfer to electrons 


and protons is 
Apy — 90¢(3) . Ti \ i Ti 
as ae fp log Ta) = —1.1- Ñg log TE) (6.61) 


The logarithmic enhancement here is related to the fact that electrons and baryons per- 
manently lose their energy due to the cosmological expansion, and photons permanently 
heat them up. We will see in Sec. 6.3.3 that energy loss by photons distorts the CMB 
energy spectrum. We will also discuss in Sec. 6.3.3 the relevant interval of temperatures, 
which enters (6.61). 


6.3.2. Coulomb scattering: thermal equilibrium of electron-proton 
component 


Let us now discuss heating of protons. The direct interaction of proton with photons 
is irrelevant, since the corresponding time for protons is given by Eq. (6.58) but with 
Mp substituted for me. Since the Thomson cross-section is proportional to m7?, the 
time for protons is larger by a factor (mp/ me)”; this time is larger than the Hubble 
time (6.42). 

Energy transfer to protons occurs due to elastic scattering of electrons off pro- 
tons. The non-relativistic cross-section of the latter process in the proton rest frame 
is given by the Rutherford formula, 


ta? sind 


Lae 5 6.62 
m2v4 sin* 6/2 ene 


don = 
where v is the electron velocity and @ is the scattering angle. The differential 


cross-section (6.62) has power-law singularity d0/0? at small scattering angle, so 
the total cross-section diverges, and the mean free time is formally zero. However, 


3 This is not true before recombination of helium. Baryons bound in helium nuclei make up 25% of 
the total number of baryons; half of the baryons in helium are neutrons. Thus, before recombination 
of helium one has ne = ng — (0.25/2)ng œ 0.88nB, np = 0.75ngB, nye = 0.25nB/4 ~ 0.06nB, 
Nnr = 1.69ngny. This leads to the correction factor in (6.61) equal to 1.6978/fpB = 1.13. This 
correction is unimportant for our estimates. 
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we are again interested in energy transfer time Tp rather than the mean free time. 
To obtain its estimate we note that the energy transfer to proton initially at rest is 


AE = 2—2.B.(1— cos 8), (6.63) 
Mp 


where Ee is the electron energy. 
Problem 6.15. Obtain the formula (6.63). 


Like in the case of Compton scattering, energy transfer to a moving proton 
has a sign-indefinite contribution which is suppressed by y/Me/Mp rather than by 
Me/Mp as in (6.63). However, upon averaging over angles this term again has the 
same effect as the term (6.63), so we stick to (6.63). 

Making use of (6.62) and (6.63) we obtain the estimate 


do, AE (0)\~* 8neTa? oN T! 
TE ~ (re | dé 0 E = mpmev? J dé cts5 . (6.64) 
The integral in the right-hand side of (6.64) is still divergent at small angles, but now 
the divergence is logarithmic. This softening of the divergence is natural: the energy 


transfer is small at small collision angle. To obtain finite value for the integral 
in (6.64) we recall that the Coulomb interaction in plasma experiences the Debye 


screening at distance [78] 
p \12 
Fp = ; 
P ATNeQ 


Hence, the integral in (6.64) must be cut off at the scattering angle corresponding 
to impact parameter equal to rp, 


Q 


Op ~ (6.65) 


MeV?Tpo 


Problem 6.16. Making use of the Rutherford formula (6.62), obtain the estimate 
(6.65) for 0p. 


The leading contribution to the integral in (6.64) comes from small scattering angles 
0 ~ @p, so we obtain the energy transfer time 


(T) MpMev™ MpMe ( Z) i‘ (6.66) 
T. SG : 
i l6mne(T)a2log65' 16mn_.(T) a? log (6Trp/a) \me 

Numerically, 


Th(Tr) ~ 3- 10's, 


which is a very short time compared to the Hubble time at recombination (6.42). 
Hence, kinetic equilibrium between electrons and protons is an excellent approxi- 
mation. 
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The estimate for protons is valid for electrons as well, with me substituted for 
Mp and modulo a numerical fator of order 1. Combined with the results of Sec. 6.3.1 
this means that electrons and protons have equilibrium distrbution functions with 
temperature equal to photon temperature. 


6.3.3. Thermal (in)equilibrium of photons 


Similarly to electrons, kinetic equilibrium of the photon componet is due to the 
Compton scattering. The characterisctic time of energy exchange is obtained in the 
same way as in Sec. 6.3.1, but now the mean free time between the collisions of 
photons with electrons equals (orn-)~! (rather than (orn)! for electrons). As a 
result, we have for photons 


Me 


(6.67) 


E OTNeT 
This time is much greater than that of electrons (6.58) because of small value of‘ 
Ne ~ NBN. This time increases in the course of expansion as T~*, which is faster 
than the increase of the Hubble time. Therefore, photons are in kinetic equilibrium 
at high temperatures, but eventually the kinetic equilibrium breaks down. This 
happens, as we show immediately, at radiation domination, when H = T? /M5, 
where the effective number of degrees of freedom is g, = 3,36; see Sec. 4.4. By 
equating the time (6.67) to the Hubble time H~! we obtain an estimate for the 
temperature at which photons get out of kinetic equilibrium: 


nê ii 1/2 
[eS (na eee ~3-10°K. 
k (© Teens) 


This is indeed much larger than the temperature of the transition to matter domi- 
nation Teg = 0.8 eV ~ 104 K. 

It is of interest also to estimate the time at which an energetic photon of fre- 
quency T <w X Me loses a substantial part of its energy. This happens when the 
number of its collisions with electrons is of order w/AE, where energy loss in each 
collision is given by (6.55). Therefore, the time scale in queston is 


Me 


Tw ™ : 
OTNeW 


Let us apply this to a Lyman-a photon, w = 10,2 eV, which is emitted at recombi- 
nation. We obtain 


Tiya = 3 108 years , 


4More precisely, ne ~ 0.88npBn-+ before recombination of helium; see footnote 3. This subtlety is 
unimportant for our estimates. 
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Fig. 6.4. Examples of diagrams of double Compton scattering. 
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Fig. 6.5. Bremsstrahlung diagram for electron e which scatters off nucleus Z. 


which is much greater than the Hubble time at recombination. It is for this reason 
that we claimed in Sec. 6.2 that Lyman-a photons (and also photons of other lines, 
2s—1s-transitions, etc.) come to us without losing their energies. 

Compton scattering changes photon energies, but does not change the total 
number of photons. This is insufficient for complete thermal equilibrium: it needs 
processes with emission and absorption of extra photons. These are double Compton 
process, Fig. 6.4, and bremsstrahlung on protons and helium nuclei, Fig. 6.5. 

Let us make a crude estimate of the temperature at which double Compton 
processes get out of equilibrium. We assume for this purpose that incoming and 
outgoing photons have energies of order temperature. Then the diagrams of Fig. 6.4 
immediately give an estimate for the double Compton cross-section 

3 
CDO ™ 2r., 
e 
Hence, the change of the photon number density due to the double Compton pro- 
cesses is roughly estimated as follows: 


dn a? 

y 5 

(= ~ ODCNeNy ~ —7 npln,. 
DC Me 


This gives for the characteristic time 


a? =a 
5 
TDO (Ser ) a 
m 


e 
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By equating it to the Hubble time H~' = M},/T? we obtain a crude estimate for 
the temperature at which double Compton processes get out of equilibrium: 


mê 1/3 
Toc ~ | =E ~5-10°K. 
ia er) 


This estimate, however, is unrealistically high. The point is that the double Compton 
cross-section is enhanced when the energy of one outgoing photon is low [89] (see 
also Ref. [90]), w2 & wi & me: 


4a w? dw 
donc = eS — cos meer. i (6.68) 


where ĝı is the scattering angle of the energetic photon and 
2 


a 
2 
2m2 


dor = (1 + cos? 01) sin 0,d0,d¢1 

is the Thomson differential cross-section. Besides a large numerical factor (which 
is partially due to integration of the cross-section (6.68) over w with Planckian 
distribution), this leads to an effect of qualitative character. Since the cross-section 
is large at small w2, thermal equilibrium is maintained considerably longer at low 
photon energies (Rayleigh—Jeans region) as compared to the rest of the spectrum. 
Relatively fast single Compton scattering transfers photons from the Rayleigh—Jeans 
region to the main part of the spectrum where photon energy is of order T. Thus, 
the overall chemical equilibrium is maintained longer. An accurate analysis of this 
phenomenon gives [88] 


Tro = Ty chem 6+ 10° K. 


The reason for equating the estimate valid for double Compton scattering to overall 
Ty chem is that the bremsstrahlung processes are somewhat less efficient. 

To see that bremsstrahlung is subdominant, we write its differential cross-section 
for an electron moving with the initial velocity v, which scatters off nucleus of charge 
Z and emits a photon of frequency w [81]: 
16r 2703 g(v,v') dw 


3V3 Mm? vV w’ 


where v’ is the final electron velocity, v? = v? — 2w/me, and the Gaunt factor 
g (v, v’) has simple forms in limiting cases: 


do Br = 


a 


L In ge v, V > aZ 
g= , . 
T} v —o0 


Here the dependence on the photon frequency w is essentially the same as the 
dependence of the cross-section (6.68) on w2, so the above discussion of soft photons 
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applies. The ratio of times characteristic of double Compton and bremsstrahlung 
processes is of order® 


TDC ((opc)neny)~! ee 
Tar  ((oprv)Nenp)—! 1B CT , 


This ratio is small at T > 6- 10° K, so the double Compton scattering dominates 
indeed. 

To summarize, the processes that change the number of photons get out of ther- 
mal equilibrium at the highest temperature Ty chem ~ Tpc. This is the temperature 
of the black body photosphere of the Universe. Any effects that heat up or cool down 
the photon gas at T, < T S Ty chem give rise to the distortion of the observable 
CMB spectrum. These effects include the heating of electron—proton component 
which we discussed already, dissipation of inhomogeneities in energy density (Silk 
effect), and some others. These distortions will hopefully be detected in future pre- 
cision measurements of the CMB energy spectrum. An exotic possibility is decays 
of hypothetical long-lived particles (see below in this Section). 


Let us now discuss the following situaton. Consider some time before recombination and 
assume that some photons are injected at that time in a certain spectral region (for def- 
initeness, in the Wien region of high energies); we leave the physics behind this injection 
unspecified. Since electrons are quickly heated due to the Compton scattering (Sec. 6.3.1) 
and electron—baryon component rapidly thermalizes by Coulomb interactions (Sec. 6.3.2), 
its temperature becomes higher than the temperature of photons away from the injection 
spectral region. The interaction of warmer electrons with photons leads to the distortion 
of the photon energy spectrum. We are interested in the shape and amplitude of this dis- 
tortion. Note that instead of injection we could consider absorption of photons (still in the 
Wien region) or injection of energy directly into electron—baryon component. In all these 
cases the CMB spectral distortion away from the injection/absorption region is described 
in the same way. For definiteness we talk about the injection of photons into the Wien 
region. 

The results we are about to discuss are almost literally carried over from the theory 
developed by Y. Zeldovich, R. Sunyaev and collaborators for describing the heating of 
CMB photons that travel through hot ionized gas in clusters of galaxies. This is the 
Zeldovich-Sunyaev (SZ) effect [19]. In cosmological context, the results depend on the 
epoch of injection. 

Suppose first that the injection occured at temperature T; > Ty chem ~ 6- 10° K. Then 
the Coulomb and double Coulomb scattering lead to complete thermalization of the cosmic 
plasma, and the CMB spectrum remains Planckian with slightly higher temperature. The 
only observable consequence in that case is the difference between the baryon-to-photon 
ratio ņg at Big Bang Nucleosynthesis and at recombination. The only relevant parameter 
is the total energy of injected photons relative to the energy of photons already present. 
This completes the analysis of the high injection temperature, T; > Ty chem. 

The second possibility is that the injection temperature T; is in the range Ty kin < 
Ty << D chems Leya 10° K < T; < 6- 10° K. In this case, double Coulomb scattering 
and Bremsstrahlung are switched off, while single Compton scattering establishes kinetic 


5 Helium has not yet recombined by the time we discuss, so the bremsstrahlung rate is proportional 
to Ne: (np + 4nHe) = Neng. 
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equilibrium between electrons and photons without changing the number of photons. In 
this situation, the photon energy spectrum has the Bose-Einstein form with non-vanishing 
chemical potential: 


1 
fw) = Soar 7° (6.69) 
where the photon temperature is determined by the injected energy, and we introduced 
the distribution function related to the function f(p) of Sec. 5.1 as follows: 


f(w(p)) = (27)* F(p) . 


This spectral distortion is called ju-type. The chemical potential u is negative, if energy is 
injected® (there are less photons in the gas as compared to complete thermal equilibrium 
at temperature T), and, vice versa, u is positive if energy is absorbed. Note that after 
the kinetic equilibrium is established, the chemical potential decreases like temperature, 
u xa’. This follows from the conservation of the number of photons in comoving volume. 
The property u œ a + holds after recombination as well, cf. Sec. 2.5. 


Problem 6.17. Prove the last two statements. 


Problem 6.18. Show that at small u photon number density and energy density are 


— 2¢(3),,3., 1 2 
n= =) T° + 3AT ; 
2 
T 4 6¢(3) 3 
p= pe + = uS; 


Note that the first of these relations is the same as (5.21); the factor 1/2, that has to do 
with the fact that we consider the number of photons rather than the difference between 
the numbers of particles and antiparticles, is canceled out by the factor 2 coming from two 
photon polarizations. 


Problem 6.19. Let Ap be small energy density injected at temperature T; (where Ty kin < 
Ti < Tocher 

1) Making use of the results of Problem 6.18, show that after kinetic equilibrium is 
established, the photon temperature is T; +6T where ôT is related to the chemical potential 
as follows: 


T u 
T” =- = — =. 
: 18c(3)"——«2.19 


2) Find the relationship between u and Ap and show that, numerically, 


Hp gor 
T p 


6Some authors define the chemical potential with the opposite sign, see, e.g., Ref. [88]. Some other 
authors use the therm “chemical potential” for a quantity which in our notations is —u/T; see, 
e.g., Ref. [91]. 


6.3. * Kinetic Equilibrium 149 


The results of the latter Problem together with the results of Sec. 6.3.1 show that the 
energy transfer from photons to electron—baryon component in the expanding Universe 
gives rise to the spectral distortion of -type at the level 


6- 10° 


H n= . . . —_ 
F = +14- 1.1 1.13 -0.75np oe (£ OF 


) ~ 424-1079. 


(We make use of the correction factor described in footnote 3.) The dissipation of inhomo- 
geneities (Silk effect) produces spectral distortion of y-type at similar level with negative ju. 
Finally, let us consider the case when the injection occurs at Ti < Ty kin ~ 3- 10° . 
In that case one makes use of an approximate kinetic equation for f(w), which is obtained 
from the Boltzmann equation in the limit of small energy transfer in each electron—photon 
collision. This is the Kompaneets equation, and it reads in the expanding Universe 


A gut Bane 1 ð jot (nats) 


a (Oe (iin WP Oe Ow 


The derivation of the Kompaneets equation is given in the book [85]; instead of reproducing 
it here, we make a few comments. First, the second term in the left-hand side describes 
photon redshift: if the right-hand side vanishes, then the distribution function does not 
change its form, f(w,t) = flwa(t)/ai,ti], cf. Sec. 2.5. Second, one makes use of the fact 
that the electron component is thermalized at temperature Te. Third, the evolution of the 
distribution function at frequency w depends only of the properties of this function in a 
neighborhood of w, i.e., Kompaneets equation is local in w. This reflects the fact that the 
energy transfer in each collision is small. Fourth, it follows from this equation that the 
number of photons in comoving volume is conserved. Indeed this number is 


na? =a°*-2- / dete ; 
0 (27)? 


and we have for its time derivative 


1 O(a®n) Ən 
ae ð Ot. "art 


_ wdw Of Tne 1 O 4 OF 2 
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The right-hand side is actually equal to zero upon integration by parts. Fifth, the factor 
mz ' originates from the energy transfer in each electron-photon collision, which is pro- 
portional to w? /Me. Sixth, the right-hand side of the Kompaneets equation vanishes, if the 
photon distribution function has the Bose-Einstein form (6.69) with T = Te, which has 
to be the case. Finally, the three terms in the right-hand side are interpreted as follows. 
The term Teðf/ðw is related to the Doppler effect in collisions of photons with moving 
electrons (it is proportional to Te rather than ve « Te because of angular averaging, see 
Sec. 6.3.1). The second and third terms describe the retardation effects (electron gains 
and photon loses energy); the third term is proportional to f? and has to do with induced 
photon scattering. 

Since each electron collides with photons very often, the electron temperature is such 
that the electron component as a whole does not gain or lose energy (modulo small heating 
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effect in the expanding Universe). So, this temperature obeys 


oo 2 
w dw Of OF 
= — Huw} =0. 
[ mm” (3 Iw 
This equation means that the total energy density of photons evolves because of the 
cosmological expansion only. It can be written as follows: 


_1fJG+P) t dw 
“A ffwrdw ` 


Te 


Since the evolution of the photon spectrum is governed by the small parameter 
oTnel’/(meH) (see below), to the zeroth order in this parameter this evolution is solely due 
to redshift, and hence Te « a~'. Having this in mind and introducing the dimensionless 
frequency 


we write the Kompaneets equation in terms of it: 


eTe 1 
ee CoO en 


Let us come back to the photon spectral distortion caused by injection of photons at 
temperature T; obeying T, < Ti < Ty kin. In this temperature range the overall factor in 
(6.70) is small compared to the Hubble parameter, so the spectral distortion away from 
the injection region is also small. Photons before injection have Planckian distribution at 
temperature T, 


1 
jeier) = Se 
where 
oe Ani te. 
= ee ae 


note that Te > T in the case of injection because of fast electon heating. Therefore, 
Friler) _ Te Ofri(er) 
Ox = T Oar ’ 
and the following identity holds: 
_ Fe(rr) 
Orr 


So, to the leading order in (Te — T) the right-hand side of Eq. (6.70) is 


ornele [{ Te 
Pree (22 24) BF 
Me ( T ) (er), 


fe(er) + fri(er) = 


where 


— 1 ð 4 Of pi(x) _ re” er +1 _ 
ra= Sgh Be )= er (FS 4). 


(The difference between x and gr is irrelevant here.) We now write 


f=frit of, 
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Fig. 6.6. CMB spectral distortion of y-type [88]. 


and finally obtain the CMB spectral distortion 


ðf = yF (w/T) , 


tr OTNeT (Te Ti orne (Te 
=f dt =e (F-1)-f ar “the (F1). (6.71) 
The spectral distortion has a universal shape (away from the injection region), and the 
only relevant parameter is its amplitude y. Such a distortion is called y-type; see Fig. 6.6. 
Since ne x T?, and H « T? (H œ T°/?) at radiation domination (matter domination), 
the integral in (6.71) is saturated near the injection temperature T;. The higher T;, the 
stronger the distortion at given T./T, i.e., at given Ap/p. Distortions of y-type caused 
by the heating of electron—proton component and Silk effect are expected at the level 
y=10-%—107°. 
Genuine CMB spectral distortions have not been detected yet (the qualification has 
to do with Zeldovich—-Sunyaev effect). The bound on the distortion of y-type is set by the 
analysis of the data from FIRAS instrument [92]: 


where 


ly] S1.5-107° 


at 95% C.L. The bound on the distortion of p-type is 


| —4 
T, < 0.9.10 
at 95% C.L. 

The distortion of the CMB energy spectrum is possible in extensions of the Standard 
Model of particle physics in which new particles decay into charged Standard Model par- 
ticles and/or photons at relatively late time, z < 107”. Candidates to these new particles 
are neutralino, charged sleptons, gravitino in supersymmetric extensions of the Standard 
Model, and also sterile neutrinos and others. Among even more exotic candidates are par- 
ticles with very small electric charge. The negative results of search for CMB spectral 
distortion places limits on the parameters of the extensions of the Standard Model. As 
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Fig. 6.7. Allowed region in the parameter plane (mxnx/ny,tx) for a model with new non- 
relativistic particles of mass mx, initial number density nx and lifetime tx, which decay into 
photons [93]. The region above the solid line is ruled out by search for CMB spectral distortion; 
other lines are obtained within various analytical approximations. We note that current bounds 
on parameters y and u are somewhat stronger than those used here; they are given in the text. 


an example, we show in Fig. 6.7 the limit valid for non-relativistic particles of mass mx, 
initial number density nx and lifetime tx which decay into photons [93]. 


Problem 6.20. Consider a model with new light stable particles X,X which have very 
small electric charge ce, € < 1, where e is the electron charge. Assuming that they are pair 
created in electromagnetic interactions in cosmic plasma and that the CMB spectrum is 
Planckian at the level ~ 107%, find the allowed region in the parameter plane (e, mx). 


6.4. Horizon at Recombination and its Present Angular Size. 
Spatial Flatness of the Universe 


CMB photons, which last scattered at recombination, give us a photographic picture 
of the Universe at temperature T, = 0.26eV and redshift z, = 1100. The character- 
istic length scale at recombination epoch is the horizon size at that time, lar. This 
scale is of interest for two reasons. First, in the Hot Big Bang theory, regions sepa- 
rated at recombination epoch by distance exceeding la r were causally disconnected 
at that epoch. Second, this scale is imprinted in the properties of CMB photons. 
We show in the accompanying book that CMB anisotropy and polarization have 
features in their spectra whose angular sizes are determined, roughly speaking, by 
the angle at which the horizon at recombination is seen today. In this Section we 
give the preliminary estimate of this angle and discuss its dependence on cosmo- 
logical parameters. Our purpose here is to give a general idea of how cosmological 


6.4. Horizon at Recombination and its Present Angular Size. 153 


parameters are determined from CMB measurements and other observations; this 
is discussed in more detail in the accompanying book. 

In fact, more relevant for CMB is the sound horizon at recombination (4.52). 
This follows from the discussion in Secs. 1.7.1 and 4.7; we will heavily use the 
notion of the sound horizon in accompanying book. The sound velocity depends 
on the baryon density, see (5.38), so the sound horizon size depends on 7g. This 
parameter, however, is well known from the analysis of Big Bang Nucleosynthesis 
(see Chapter 8) and CMB measurements, so there is basically one-to-one corre- 
spondence between the sound horizon and particle horizon. In this Section we will 
consider particle horizon la, r, having in mind this qualification. 

As we have noticed already, recombination occurs at matter dominated epoch. 
So, we estimate the horizon size at recombination by making use of Eq. (3.26): We 
write 

2 2 1 
He Hom (1+ 2r)3/?? 


where H, = H(t,) is the Hubble parameter at recombination. Precise value of lz, 
is obtained by making use of the result (4.29), which can be written as 


2 Í 
Hom (1+ 2,)8/?’ 


1/1lte\¥ T, \ "2 T, \ "2 
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Numerically f = 0.56 for Qm = 0.315. 


lu,r = 


(6.72) 


lur = a(tr)n(tr) (6.73) 


where 


Problem 6.21. Neglecting the contribution of radiation, show that the Hubble 
parameter at recombination can be written as follows: 
16¢(3) Qm T? B 


H(T,) = (= 
a ( 3r Op "PM, 


where mp is proton mass. Thus, the apparent dependence of the expression (6.72) 
on the parameters of the present Universe is actually not there. 


The length interval (6.73) has stretched (1 + zr) times since recombination, so 
its present size is 


2 i 
Hov Qm VIF zr 


Clearly, this size is about VI + z, ~ 30 times smaller than the present horizon size 
(4.35); a more precise estimate gives 50 instead of 30. In other words, the Universe 
visible at present contains about 10° regions that were causally disconnected by 
recombination. (Of course, this is true in the framework of the Hot Big Bang theory 


la r(to) = 
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only.) Nevertheless, these regions were exactly the same, we know that from both 
CMB observations and galaxy surveys. How did it happen that despite the absence 
of causal contact the different regions have exactly the same properties? This ques- 
tion cannot be answered within the Hot Big Bang theory; this is a problem for this 
theory called horizon problem. The horizon problem finds its elegant solution in 
inflationary theory. 

Let us calculate the present angular size of a region whose spatial size at recom- 
bination was equal to la, r. Like in Sec. 4.6, we do not assume that the Universe is 
spatially flat: as we discuss in this Section at the qualitative level (and quantitatively 
in the accompanying book), it is the calculation of this sort and its comparison with 
CMB data that lead to the result that the spatial curvature is close to zero. We 
assume for simplicity that dark energy density p, is constant in time (cosmological 
constant); the generalization to time-dependent dark energy is straightforward. 

Like in Sec. 4.6, let us choose the open cosmological model (x = —1, Qeurv > 0), 
whose metric is given by (2.10). The last scattering surface is at the coordinate 


distance 
to dt 
— f Ba 6.74 
a a ie 


CMB photons traveled precisely this distance since recombination. This coordinate 
distance is still given by Eq. (4.40), where we have to set z = zp. Since zp >> 1, the 
integration in (4.40) can be extended to zr = oo, which corresponds to the limit 
tr — 0 in the integral (6.74). In physical terms this means that we neglect the 
difference between the distance that photon traveled since recombination and the 
present horizon size. Hence, 


œ dz al 
Xr = Xm, = J TEE (6.75) 
0 


ao Ho Rull + 2)? FO, + Qeurv(l + 2)?’ 


which is the coordinate size of the present horizon. Making use of the results of 
Sec. 4.7, we write the angular size, 


lär 
A90, z 


where ra(zr) = (1+ 2-)~++ao-sinhy, is the angular diameter distance to recombi- 
nation. We finally obtain 


2 f 
o/h a0H sinh xr Vzr +1 


When discussing this formula, one should bear in mind the relations (4.45) and 
(4.46). 

Let us begin with the hypothetical case of spatially flat Universe without dark 
energy, Qeury = 0, Qa = 0. Spatially flat Universe corresponds to the limit ao — oo, 


A9, (6.76) 
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which gives 


f 
Ab, = ===, Ney = Na = 0. 6.77 
zr +l i en) 
Of course, this result would be easier to obtain by using directly the formulas for 


the Universe filled with matter, given in Sec. 3.2.1. 


Problem 6.22. Obtain the formula (6.77) directly in the model of spatially flat 
Universe filled with non-relativistic matter. 


For z, = 1100 the formula (6.77) gives A0, = 0.017, or A@, = 1°. A manifes- 
tation of the horizon problem is that photons coming from directions separated by 
more than 1° were emitted in causally disconnected regions, and yet they have the 
same temperature within 0.01%. 

Continuing the discussion of formula (6.76) we recall that the angle’ A0, deter- 
mines the angular scale of the features in the CMB angular spectrum. Hence, the 
angle A0, is a measurable quantity. It is therefore of interest to consider the depen- 
dence of A@, on cosmological parameters. Since zp is known, there are only two 
relevant parameters:® due to relations (4.45) and (4.46) this pair may be chosen 
as (Qm, Qeurv). To see which parameter is more relevant (in the sense that the 
dependence on it is stronger), let us first consider the case of spatially flat Universe, 
Qeurv = 0. Unlike in the case (6.77), we now have Q, 4 0 and therefore Qu 4 1. 
We take the limit ao — oo, and the expression (6.76) becomes 


aud 1 
Aar FILI) 


Ab, Ouro = 0; (6.78) 
where 


I= 


TOT dz 
: f (6.79) 
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and Q, = 1 — Qu. Changing the variable to (1 + z) = y7? we write this integral in 


the following form, 
1 
0 A 
IH ys 
Du 


It is clear that if Qw is not very small, this integral depends on Q,/Q,, rather 
weakly. At Qm = 1, Q, = 0 it is equal to 1, while for Q,, = 0.315, OQ, = 0.685 its 


(6.80) 


TMore precisely, similar angle determined by the sound horizon; we talk about the angle A8, for 
definiteness, while the discussion applies to the observable angle as well. 

8In the realistic case, there is some dependence on other parameters, but one of these parame- 
ters, baryon-to-photon ratio, is known from other observations while dependence on others (e.g., 
neutrino masses) is weak. 
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value is 0.91. We conclude that the dependence of the angle A@, on the distribution 
of energy between Qw and Q, is rather weak. 

On the other hand, the angle A0, depends on Qecury quite strongly. To see this, 
let us consider the hypothetical case Q, = 0, when Qy, + Qeury = 1. In that case 
the change of variables (1 + z) = y~? gives the analytic expression for the integral 
(6.75), namely 


Qeurv 
Xr = 2Arsinh 2 Q, =0, 


M 


where we made use of (4.46). The angle A0, is then given by 
f 1 
Vzr +1 V 1+ Goure Oey 


Clearly, at Qeury ~ Qu this result is quite different from the results (6.77) and 
(6.78) which are valid for the flat Universe. It becomes clear that the measurement 
of A@,. (more precisely, angular scales related to it) gives strong bound on the spatial 
curvature of the Universe. 

Finally, let us consider the case when Qeury is small compared to both Q, and 
Qa, while Qw is roughly comparable to Q,. Then one can neglect Qeury in the 
square root in (6.75) (this corresponds to small contribution of curvature into the 
Friedmann equation, see the discussion in Sec. 4.2), and using (4.46) we obtain 


Qeurv 
n= 24/ GE (Oar Ma); 
M 


where J is the same integral (6.79) or (6.80), but Q, is no longer equal to (1 — Qw). 
Using (6.76) again, we find 


7 F 24/ Qeurv/Qu 
Aa, = = > V@§ ——. (6.81) 
Vir + 1 sinh (2 Deure/ ae) 


The spatial curvature shows up here through the fact that the denominator contains 
hyperbolic sine rather than linear function; the angle at which a given interval is 
seen in hyperboloid is smaller than the angle in Euclidean space. The dependence of 
the right-hand side of (6.81) on Qeurv/Nm is rather strong, unlike the dependence on 
Q,/Qy,. This reiterates our conclusion that A0, is particularly sensitive to spatial 
curvature. Let us note that we again encounter the degeneracy in parameters, as 
we did in Sec. 4.6, but now the less important parameter is Q,/Qy,. 

The first measurents of the CMB angular anisotropy at relatively small angular 
scales (about an angular degree) already gave rise to the conclusion that the spatial 
curvature of our Universe is small. Together with the SNe Ia observations they lead 
to the ACDM model as the working hypothesis (see Fig. 6.8). Later data are not 
(yet?) in contradiction with the ACDM model, they rather pin down its parameters 
further. This is illustrated in Fig. 6.9. 
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Fig. 6.8. Early bounds on cosmological parameters from the analysis [86] of the CMB anisotropy 
data together with SNe Ia observations. Regarding contours, see comment to Fig. 4.9 given before 
Problem 4.9. 
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Fig. 6.9. Allowed region in the space of parameters (Qm, QA) from CMB anisotropy data 
(+TE+EE, lensing) and data on baryon acoustic oscilations (BAO), the preferable values of the 
present Hubble parameter Ho are indicated in color [94]. Shown here are parameter regions allowed 
at 68 %-95 % C.L. See Fig. 13.14 for color version. 


Problem 6.23. Calculate the integral (6.75) numerically and draw the lines of 
constant Af, on the plane (Qa, Qumu). Compare with Fig. 6.8. Hint: Use the relations 
(4.45), (4.46). 


Problem 6.24. Find the size of the sound horizon at recombination. Hint: recall 
that the sound speed in the baryon-photon medium depends on time through the 
ratio pg/py, Eq. (5.38). At what angle is this sound horizon seen today? 


Chapter 7 


Relic Neutrinos 


The earlier the period in the history of the Universe, the higher the temperature 
and density of matter. Hence, those processes which are too weak to be relevant in 
the present Universe, are important at earlier stages and may leave imprints in the 
Universe we see today. 

We have discussed the phenomenon of this sort in Chapter 6 in the context of 
relic photons. Here we move further back in time and consider other light particles, 
neutrinos. 


7.1. Neutrino Freeze-Out Temperature 


Let us estimate the temperature at which neutrino interactions between themselves 
and with cosmic plasma switch off (“freeze-out temperature” ). We will see that this 
temperature is of the order of a few MeV. At this epoch, electrons and positrons are 
still relativistic, and their number density is given by (5.16b). On the other hand, 
baryons are non-relativistic, so their abundance is suppressed by a factor of order 
ns relative to the abundance of ete~-pairs. Hence, as far as neutrino freeze-out 
is concerned, relevant processes are neutrino scattering off electrons, positrons and 
themselves and neutrino-antineutrino annihilation into e*e~-pair and neutrino— 
antineutrino pair of different type, as well as inverse processes. In all these processes, 
particles are relativistic at temperatures of interest. 

We will not need the precise value of the neutrino freeze-out temperature in what 
follows. So, a dimensional estimate for the cross-sections of the above processes is 
sufficient for our purposes. Neutrinos participate in weak interactions only (see 
Appendix C). At energies of interest, the cross-sections are proportional to the 
Fermi constant squared G3, where 


Gr =1.17-107> GeV~?. 


On dimensional grounds, we immediately obtain for the cross-section of any of the 
above processes at E > me, 


oy ~ GLE’, 
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where FE is a typical collision energy, E ~ T. 
The mean free time of neutrino is given, as usual, by 


1 


aT (apnv)’ 


(7.1) 
where v is the relative velocity of neutrino and a particle it collides with, and n is 
the number density of the latter particles. In our case all particles are relativistic, 
the number density is given by (5.16), i.e., n ~ T? and v œ 1. In this way we come 
to the estimate of the mean free time, 


1 
Ty Y =. (7.2) 
GATS 
We now compare 7, with the Hubble time (see (3.31)) 
= My 
go = Mb, (7.3) 


We see that as the Universe cools down, 7, increases faster than H~'. Thus, in 
accordance with our expectation, at early stages the mean free time is shorter than 
the Hubble time, and neutrinos are in thermal equilibrium with matter. Indeed, the 
number of neutrino collisions since time t is estimated as 


S dt! e t t 1 
N(t)~ = L anw Ie anwa, 
t T(t) t t y(t’) T(t) H (t)r (t) 
where we made use of the fact that 
t 1 


Tult)  HA(t)r(t) 
rapidly decreases in time. If N(t) >> 1 then neutrinos are in thermal equilibrium, 
while for N(t) < 1 they are non-interacting. Hence, neutrino interactions switch 
off at 


T(T)~ H7! (T). 


It follows from (7.2) and (7.3) that this happens at freeze-out temperature! 


1 1/3 
Typ ~ (az) ~2—3MeV. 
G2 M5; 


Problem 7.1. Estimate the age of the Universe at neutrino freeze-out. 


1 This temperature weakly depends on the neutrino flavor: electron neutrinos interact with electrons 
and positrons via exchange of both W=- and Z-bosons, while muon and tau neutrinos participate 
only in reactions with Z-boson exchange. Therefore, the annihilation and scattering cross-sections 
are somewhat smaller for v, and vy; as compared to ve, and Vy and vr freeze out somewhat earlier. 
Also, neutrino oscillations play a role. All these subtleties are unimportant here. 
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Thus, at freeze-out temperature T, ¢ neutrinos interacted for the last time. Since 
then they freely propagate through the Universe and their number in comoving 
volume does not change. The very assumption that there existed temperatures of 
the order of a few MeV leads to the conclusion that the present Universe contains the 
gas of relic neutrinos analogous to the gas of relic photons, CMB. We will see that 
the latter assumption is strongly supported by Big Bang Nucleosynthesis theory and 
observations, and also other data, so there is no doubt that relic neutrinos indeed 
exist. 


7.2. Effective Neutrino Temperature. Cosmological Bound 
on Neutrino Mass 


It follows from the results of Sec. 2.5 that neutrinos after freeze-out are still described 
by relativistic distribution function with present effective temperature 


a(t) Tog 
Ty = Ty = BE 
a uf alto) 1+ 


(7.4) 


where z, is the redshift at neutrino freeze-out. At the time of freeze-out, neutrino 
temperature equals to that of photons. After freeze-out, the photon temperature 
also decreases due to the cosmological expansion. However, at neutrino freeze-out, 
the cosmic plasma contained also a lot of relativistic electrons and positrons. As 
the temperature drops below the electron mass, electrons and positrons annihilate 
away, injecting energy into the photon component. Due to this process, the photon 
temperature becomes higher than the effective neutrino temperature. This effect 
is quantitatively described by making use of entropy conservation of the electron- 
photon component in comoving volume, 


g21(T)a®T? = const, (7.5) 


where g% (T) is the effective number of relativistic degrees of freedom in the 
electron—photon plasma, see (5.28) and (5.30). Right after neutrino freeze-out, this 
plasma consists of relativistic electrons, positrons and photons, which gives 


7 11 
go. (Tuf) = 2+ z2 +2) = 


After et e~-annihilation, the entropy is due to photons only, gf’ = 2, and we arrive 
at the following result for temperatures well below electron mass (including present 
epoch), 


T. em ay pie 
ec (S) = (+) ~ 14, (7.6) 
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We conclude that the present neutrino temperature is? 
T(to) ~ 1.95 K. (7.7) 


Making use of (5.16) we find the present number density of each species of neutrinos 
together with its antineutrinos, 
30, (3) 


Myo = 52: 


i ag Ty (to) = 112 cm”. (7.8) 


Problem 7.2. Let us make a wrong assumption that there is no Z°-boson in Nature, 


while W~ bosons exist. Furthermore, let us neglect loop processes and neutrino 
oscillations (which is also wrong). What are the relic abundances of neutrinos of 
different types? 


Direct detection of relic neutrinos is unsolved and is a very hard problem, in 
view of extremely small cross-section of low energy neutrino interactions and tiny 
energy release. 


Problem 7.3. Assuming that neutrinos are massless, estimate the mass of a detec- 
tor required for having one relic neutrino interaction per year. 


Neutrino oscillation data are consistent with the possibility that one neutrino 
species is massless. Contribution of massless neutrino to the present energy density 
would be small. Making use of (5.12) we would find for one type of neutrino plus 
its antineutrino, 


so massless neutrinos would make very little effect on the expansion of the present 
Universe. It is worth noting that this was not always the case: neutrino contribu- 
tion was sizeable at radiation domination, and, indeed, Big Bang Nucleosynthesis 
imposes strong bound on the number of neutrino species, as we discuss in Chapter 8. 

The situation is quite different for neutrinos of mass m, > T, o. Their present 
energy density is 


Pv,0 = MpNyp,0; 


and the relative contribution to the total energy density is 


— Pv,0 ~~ ( My 
Pe 1 eV 


Q, ) 0.01 n-?. (7.9) 


Let us require that the neutrino energy density does not exceed the total energy 
density of non-relativistic matter. Recalling that there are three types of neutrinos, 


Let us recall that massive neutrinos today have relativistic distribution in momenta with effective 
temperature (7.6). Their abundance is independent of their mass and is given by (5.16). 
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we obtain the cosmological bound on the sum of their masses [98] 


? 


XO my, < 100- h?°Qu eV. (7.10) 


With conservative bound of the early 1990s, Qu < 0.4 and h = 0.7, we obtain 
XO my, < 20 eV. 


For a long time a similar bound was the strongest bound on the masses of u- and 
7-neutrino. Presently one can combine the direct limit on electron neutrino mass [1], 
my, < 2 eV, with the results of neutrino oscillation experiments. The latter show 
that the differences of masses squared Am? between ve, V, and v, are small, Am? < 
5-107? eV?. So, all types of neutrinos have low masses, 


my <2eV, (7.11) 


see Appendix C. The bound (7.11) and the relation (7.9) show that the contribution 
of all types of neutrinos into energy density is not very large, 


XC Qu, < 0.12. (7.12) 


Nevertheless, by comparing this to the relative energy density of all non-relativistic 
matter, Qu =œ 0.3, we see that the bound (7.12) per se allows neutrinos to be 
substantial part of dark matter. However, the data on cosmic structures and CMB 
exclude the relative contribution of neutrinos at the level 


XC 0,,h? < 0.002—0.005, (7.13) 
depending on details of the analysis, see Fig. 7.1. This corresponds to the bound on 
the sum of neutrino masses [94, 99] 


XO m, < 0.2-0.5 eV (7.14) 


and rules out neutrino as a dark matter candidate. We discuss the origin of the 
bound (7.13) in the accompanying book. 

It is worth noting that neutrino oscillation data (Appendix C) tell that the 
differences in neutrino masses squared are small: the largest of them equals m2,,,, 
(0.05 eV)’. Therefore, the limit (7.14) means that the mass of each type of neutrino 


is bounded from above: 


w 
~ 


my, < 0.2 eV. (7.15) 


This is substantially stronger than the direct experimental bound (7.11). 

To end this Section we note that the above results have been obtained under the 
assumption that there is essentially no asymmetry between neutrinos and antineu- 
trinos in the Universe. In other words, we assumed that the neutrino chemical 
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Fig. 7.1. Limits on the sum of neutrino masses obtained under various assumptions on other 
cosmological parameters and using different cosmological datasets [104]; Ho (left) is the present 
value of the Hubble parameter, Ne yy (right) is the effective number of neutrino species. See 
Fig. 13.15 for color version. 


potential is close to zero. This assumption is quite plausible, especially in view 
of the observation that electroweak processes at T = 100 GeV equalize, modulo 
factor of order 1, lepton and baryon asymmetries (see Chapter 11), and that the 
baryon asymmetry is very small, 7, ~ 107°. Nevertheless, one cannot rule out 
completely the possibility of substantial neutrino—antineutrino asymmetry. In that 
case, neutrino oscillation data (in particular, lower bound on the mass of the heav- 
iest neutrino, My > Matm % 0.05 eV), together with the bound (7.13) can be used 
for placing the bound on the lepton asymmetry in the present and early Universe. 
This is not a particularly strong bound, and we leave it for the problem. 


Problem 7.4. Making use of the neutrino oscillation results (Appendix C) and 
the bound (7.13), obtain the bound on neutrino asymmetry in the present Universe 
defined as 


Axo = DAUA — no) 


S 


where s is entropy density. 


Stronger bound on the lepton asymmetry is obtained from Big Bang Nucleo- 
synthesis. In Sec. 8.1 we obtain the bound on the chemical potential of electron 
neutrino at temperature of order 1 MeV at the level |, /T| < 0.025. Precise bound 
(at 68% confidence level) is 


—0.023 < re <0.014, T~1MeV. (7.16) 
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In fact, this bound applies to all types of neutrinos, since neutrino oscillations 
equalize neutrino abundances before neutrino freeze-out. Indeeed, the typical time 
for neutrino oscillations at energy FE is (see Appendix C) 


4E 
Am2 
At E ~ 3T, T ~ 3 MeV and even for the smallest value Am2, ~ 8- 1075 eV? this 


sol — 


losa = m 


time is 
tose ~ 5- 1074s, 


which is much shorter than the Hubble time at freeze-out, H~!(T ~ 3 MeV) ~ 0.1s. 
Note that we neglected matter effects in our estimate; this is indeed a reasonable 
approximation at temperature of about 3 MeV (see [95, 96] for details). Hence, at 
neutrino freeze-out 


Hve = Hv, = Hvr: T ~ 3 MeV. 


This means that the bound (7.16) indeed applies to all types of neutrinos. 
Neutrinos excess over antineutrinos is given by (5.22), and the neutrino number 
density itself by (5.16b). So, for each type of neutrinos we have 


2 
Ny — Np T Ly 


notno 9C(3) T 


This asymmetry persists until the present time, so we find from (7.16) that for each 
type of neutrinos 


We conclude that the excess of neutrinos over antineutrinos (or vice versa) is very 
small in our Universe. 


7.3. Sterile Neutrinos as Dark Matter Candidates 


The existence of neutrino oscillations points towards incompleteness of the Stan- 
dard Model of particle physics (see Appendix C for details). Some extensions of the 
Standard Model include new particles, sterile neutrinos. These may be described as 
Majorana fermions which mix with conventional neutrinos. They are sterile in the 
sense that the new fields do not interact with gauge fields of the Standard Model; 
in particular, they do not directly participate in weak interactions. Conventional 
neutrinos, interacting with W- and Z-bosons, are called active in this context. The 
total number of neutrino states is then equal to 3 + N, where N, is the number of 
sterile neutrino species. We will restrict ourselves to the case N, = 1 for definite- 
ness. We note here that sterile neutrinos have not been discovered (yet?), although 
there are anomalies (at not so high confidence levels) in various neutrino oscillation 
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experiments which may be interpreted in terms of sterile neutrinos. In any case, 
such an extension of the Standard Models is quite reasonable and worth discussing. 

We are going to consider the relatively light sterile neutrino, ms < 100 MeV. 
In the simplest models [100-102] the creation of sterile neutrino states |vs) in the 
early Universe occurs due to their mixing with active neutrinos |v.), & = e, y, T. In 
the approximation of mixing between two states only, we have 


va) = cos ĝa |v) + sin falv), |vs) = —sin@,|1) + cos balva), 


where |v.) and |vs) are active and sterile neutrino states, |v,) and |v2) are mass 
eigenstates of masses m1 < M2, and 6, is the vacuum mixing angle between sterile 
and active neutrino. Let us assume that mixing is weak, 0. < 1, so that the heavy 
state is mostly sterile neutrino |v2) ~ |v,). In this situation the mass of the heavy 
state is naturally called the sterile neutrino mass, mz = Ms. Let us assume that the 
sterile neutrino mass is large compared to the mass of active neutrino, Ms > mı. 
All these assumptions are very natural from particle physics viewpoint. 

The calculation of probability of oscillation Va +> Vs is done in the same way as 
in the case of oscillation between different types of active neutrinos (see Sec. C.1). 
For relativistic neutrino, E, >> ms, the probability of the transition Va —> vs in 
time ¢ in vacuo is 


P(Va > Vs) = sin? 20, + sin? (==) , 
n 2 (7.17) 
i AnA? Am? = m? — m? ~ m?. 


Cosmic plasma effects, however, are quite important, especially at high tempera- 
tures. The plasma affects the propagation of active neutrino |v,,), so the Hamiltonian 
in the active-sterile basis (|v), |vs)) is 


2 2 
H =U -diag (Se 


): UT + Vint, (7.18) 


where the mixing matrix U and matrix Vint describing matter effects are 


g= ieee a Vin = E ni 

— sina cos, 0 0 

The quantity Vaa can be calculated for Va = Ve, Vu, Vr by making use of the meth- 
ods of quantum field theory at finite temperatures. These methods are presented 
in Appendix D. Assuming the absence of lepton asymmetry, one finds that the 
contributions to Vxa cancel out to the first order in G. Most important are the con- 
tributions of the second order in Gpr. At interesting temperatures, T < 100 MeV, 
there are no T-leptons and few muons in the medium, but there are numerous rela- 
tivistic electrons and positrons. For this reason, Vya are quite different for different 


7.8. Sterile Neutrinos as Dark Matter Candidates 167 


types of neutrinos. One has the following expression [103] for v+, 


V7 = - sin? Oy cos? Oy «G2T* - E, = —25-G27T* «By. 
The numerical coefficient is 3.5 times larger for electron neutrino, while the 
corrsponding factor for muon neutrino is between 1 and 3.5 depending on the ratio 
of temperature to muon mass. 

By diagonalizing the Hamiltonian (7.18) one obtains effective masses and mixing 
angles of neutrinos in medium, which are different from those in vacuo. As a result, 
the oscillation probability is given by the formula similar to (7.17) but with different 
mixing angle 0®®t and period of oscillations ¢@**, 


t 
P(Vq > Vs) = sin? 20™** . sin? (z=) ; (7.19a) 
mat tas 
gmat — o (7.19b) 
sin? 204 + (cos 20a — Vaa « t¥4¢)? 
{mat 
sin 297" = TE -sin 209. (7.19c) 


Note that Vxa has negative sign, so there is no resonance: sin 20™** monotonously 
increases in time and tends to the vacuum value sin 20a: mixing in plasma is sup- 
pressed with respect to that in vacuum. At interesting temperatures, when the 
sterile neutrino mass is in the range mı < Ms < T, we have the following estimate 


for the time of oscillations in the medium, 
gmat — min(¢?°, |Vao| +) = min(@T'm,”, (0.04 — 0.01) - T7 - G5”). (7.20) 


The typical oscillation time is not only smaller than the Hubble time H~1(T), but 
also smaller than the typical time of weak interactions 7, given by (7.2). 

Scattering of active neutrino leads to the collapse of its wave function. Hence, 
during time 7, active and sterile neutrinos oscillate, and at the moment of scattering 
the coherence is destroyed. Every active neutrino Va oscillates into sterile neutrino 
vs many times before it collides with a particle in plasma. Therefore, the probability 
that after time 7, this neutrino becomes the sterile neutrino is 


1 
(P(va > Vs)) = 5 - sin? 209%, (7.21) 


where we performed averaging of (7.19a) over several oscillation periods. At temper- 
atures T > 3 MeV, when active neutrinos are in thermal equilibrium with plasma, 
this process does not reduce the number of active neutrinos but it leads to pro- 
duction of sterile neutrinos. The production rate per unit volume [',_,, equals the 
product of averaged oscillation probability (7.21) and half of the rate? of active 


3This factor 1/2 is a result of accurate calculation [97]. Yet it admits heuristic interpretation. If 
both active and sterile neutrinos interacted with plasma at the same rate, then this rate would be 
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neutrino interactions in plasma Ta = 7;,": 


Tass = sa -(P(Uq > Vs)). (7.22) 


Although we are interested here in neutrinos with energy E ~ T, let us write for 
future reference the expression for Ia in the case of r neutrino with energy E [103] 


7 
T,= S err (7.23) 


Thus, the production rate of sterile neutrinos per unit volume is approximately 
given by 


Vaseny, a ae g (P(va => Vs)) Nye: 


This gives for the density of sterile neutrinos n,, (assuming that they do not decay 
and neglecting the inverse process of their oscillation into active neutrinos) 


dny 
Tie + 3Hnu, = T7 (Pla = Vs) Mg, (7.24) 


where the second term in the left-hand side is due to the cosmological expansion. 
Equation (7.24) is conveniently written as the equation for the sterile-to-active ratio, 
d(nv, /Nva) ~~ (P(va > vs)) (7.25) 
dlog T H (TYT, 
where we use the argument T instead of t and disregard the temperature dependence 
of the effective number of degrees of freedom g+. The temperature dependence of the 
mixing angle in matter 0™** is strong, so the right-hand side of Eq. (7.25) behaves 
as T’ at low temperatures and as T7? at high temperatures. This implies that the 
production of sterile neutrinos takes place mostly in a narrow temperature interval 
around some critical temperature Tą. The latter is determined from the require- 
ment that the right-hand side of (7.25) is at maximum, i.e., the two expressions in 
parenthesis in (7.20) are of the same order of magnitude. This gives 


T, ms \ 200M mag 2 
s~ ) ~ 200 ev- (=) . (7.26) 


As the sterile neutrino production rate has strong power-law dependence on tem- 
perature, their number density is estimated as 


ny, (Tx) sin? 28a 5 Be, ee 
— so P rere T. Mz i 20a 
tT) HO) na Open 
š 2 
=9 Mls sin 26, 
ii Aaa ( 10-4 ) (oan 


equal to the decoherence rate. In reality, the neutrino state is half of the time active and half of 
the time sterile. Sterile neutrino does not scatter in plasma, so the decoherence rate is twice lower. 
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We note that our estimate for sterile-to-active ratio (7.27) coincides with the ratio 
of the rate of sterile neutrino production to the expansion rate entering (7.25) at 
temperature T = T, when this ratio is at maximum. This implies that sterile neu- 
trinos are never in thermal equilibrium with plasma, provided that the ratio (7.27) 
is much smaller than 1. 

The number of sterile neutrinos in comoving volume remains nearly constant 
afterwards, and so does the ratiot ny, /n,,. Making use of the expression (7.8) for 
the number density of active neutrinos, we find from (7.27) the estimate for the 
present contribution of sterile neutrinos into energy density, 


o, ~0.2. (82a) (= i (7.28) 
gS 10-4 1 keV/ ` f 


Accurate calculations show that this estimate is reasonably precise. Thus, sterile 


neutrino of mass m, = 1 keV and small mixing angle 0, < 1074 could serve as dark 
matter candidate. At m, ~ 1—10 keV this would be warm dark matter, since the 
sterile neutrino momenta at production coincide with momenta of active neutrinos, 
and energy of sterile neutrinos is of order T until the temperature drops below ms. 
Sterile neutrinos of masses ms <1 keV are too hot to be dark matter particles (see 
discussion in Sec. 9.1). 


To quantify whether dark matter is cold, warm or hot, one calculates the distribution 
function of dark matter particles, in our case fs(p,t). Neglecting rare processes of sterile 
neutrino conversion into active neutrino, we write the Boltzmann equation [100] 


d 
= f a’ Tasis 

where fa = fa(p, t) is the distribution function of active neutrinos Va, i.e., the Fermi—Dirac 
function (5.4) with zero chemical potential, and FPa—s is given by (7.22). We note that the 
total time derivative accounts, in particular, for the the time-dependence of momentum, 


p x 1/a, so the Boltzmann equation is written as follows: 


ð ð 


=i fs—Hp Dp f= 1 Pa sin? 202" fa(t, p). (7.29) 


4 
Since particles are relativistic and plasma is isotropic, one makes use of energy E = |p| as 
an argument of f instead of momentum p. It is again convenient to use the temperature 
of cosmic medium T istead of time t. Making use of (3.34) we write 


Ofs Of. s 1 - 2 5gmat 
-|T E H = -Ta 20° fall, E): i 
( ƏT + = qi sin fal ) (7.30) 
We now introduce the dimensionless quantity y = E/T instead of EF and note that the 
equilibrium distribution fa depends precisely on y, while the expression in parenthesis in 
Eq. (7.30) is the partial derivative of fs over temperature at fixed y. This enables us to 


4For the same reason as in the case of photons in Sec. 7.2, the number of active neutrinos in 
comoving volume increases by a factor of order 1 as the effective number of relativistic degrees of 
freedom decreases until the epoch of neutrino decoupling. This does not significantly modify our 
order-of-magnitude estimate. 
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integrate the Boltzmann equation and find explicitly the distribution function of sterile 
neutrinos. As an example, for 7 neutrino transitions we get, making use of (7.23), 
co y T4 dT’ 


T A(T"): (1+ Bsin? 22w G2T'y2)* 


(7.31) 


At finite T this gives a non-trivial distribution function. Note, however, that the final 
distribution function has the same shape as f;(E/T). Indeed, since H « T?, we introduce 
a new integration variable z = y T? in (7.31) and set the lower limit of integration equal 
to zero. Then the integral becomes a dimensionful constant, and hence the final function 
is 


fs(y,T) = fs(\p|/T) = const - f-(|p|/T). (7.32) 


Problem 7.5. Find the value of constant in (7.32). Using this value, refine the estimate 
(7.28). 


This is an example of a situation in which an out-of-equilibrium component has equilibrium 
shape of the energy spectrum. The only consquence of inequilibrium is that the constant 
in (7.32) is much smaller than in equilibrium. The shape of the distribution function 
immediately gives the average momentum of sterile neutrinos, which coincides with (5.17): 


(ps) = 3.15 T. (7.33) 


We see that all bounds on dark matter whose particles decouple being relativistic 
are valid for sterile neutrinos which have never been in thermal equilibrium. The 
strongest bound comes from the analysis of the evolution of phase space density. This 
is discussed in the accompanying book. For sterile neutrinos with the distribution 
function (7.32) the bound is [135] 


ms > 5.7 keV. (7.34) 


The sterile neutrino is, generally speaking, unstable due to mixing with active 
neutrinos. The main channel is the decay into three active neutrinos. The require- 
ment that the lifetime is larger than the present age of the Universe gives the limit 
on the parameters of the model [101]: 


50 keV \° 
6 < 16:1077 ( : ) l (7.35) 


Problem 7.6. Derive the limit (7.35). 


Sterile neutrino decays also into active neutrino and photon; this occurs at one loop. 
This process leads to much stronger bound on the parameters: the model should not 
contradict measurements of the flux of cosmic X-rays and higher energy photons. 
This bound is about 6- 10° times stronger [106] than (7.35) in the sterile neutrino 
mass range 1-10 keV. It is shown in Fig. 7.2. 

This bound is actually so strong that, taken together with the limit (7.34), it 
disfavors the non-resonant production of sterile neutrino dark matter discussed so 
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Fig. 7.2. Bounds on the parameters of sterile neutrinos (mass Mı and mixing angle 0) from the 
data of X-ray telescopes, together with predictions for sterile neutrinos as dark matter particles, 
Qs = Qpm. The thick line corresponds to the production in non-resonant oscillations (cf. (7.28)); in 
this case the bound (7.34) applies. The dashed lines correspond to the resonance mechanism, num- 
bers in units 10~° are the values of the present electron neutrino asymmetry (nv, — nv, )/s [105]. 


far [135]. Also, sterile neutrinos obeying this bound make very small contribution 
to active neutrino masses via see-saw mechanism (see Sec. C.4 for details), 022m, << 
Msol, 1.e., if light sterile neutrinos are dark matter particles then they are irrelevant 
for the generation of active neutrino masses. 

We emphasize that the results discussed here are valid for long lived sterile 
neutrinos only. Note also that even though we studied sterile neutrino mixing with 
one active neutrino species, adding two other active neutrinos does not grossly 
modify the results. 

Let us point out that there are more complicated mechanisms of sterile neu- 
trino production in the early Universe. In this sense our estimate (7.28) is, generally 
speaking, a lower bound. As an example of a realistic, albeit somewhat exotic mecha- 
nism, let us consider sterile neutrino production in oscillations in lepton-asymmetric 
medium [107]. It follows from Big Bang Nucleosynthesis theory and observations 
that the lepton asymmetry at T ~ 1 MeV must be below one per cent. On the other 
hand, under assumption that the maximum temperature in the Universe was higher 
than 100 GeV, sphaleron processes equilibrate the baryon and lepton asymmetries 
at that time (see Sec. 11.2.1), so that the lepton asymmetry at T ~ 100 GeV is of 
order np ~ 107°. This is insufficient for production of sterile neutrino dark matter, 
as we will see soon. Thus, the mechansim we are about to discuss is indeed exotic, 
as it requires the production of lepton asymmtery at T < 100 GeV. 
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In analogy to the baryon-to-entropy ratio, let us introduce the asymmetries 
Ap =, AS Au = 
s S 8 
where a = e, u, T and q; denotes the ith quark flavor. Due to neutrino oscillations, 
one has A,, = As, = A,,. Individual lepton asymmetries Az, = Av, + Aa 
are not conserved, again due to neutrino oscillations. What is conserved at T < 
100 GeV is the total lepton asymmetry Az = >>, Aza. Other conserved numbers 
are baryon number, i.e., Ag = >>, Ag, /3, and electric charge. Neglecting tiny baryon 
asymmetry Ag ~ 107° and recalling that cosmic medium is elecrically neutral, 
one sets the latter two numbers equal to zero. This does not forbid non-vanishing 
individual asymmetris in charged lepton and quark sectors, Aa and Ag,, provided 
that charged leptons œ and quarks q; are relativistic at temperature considered, 
Ma, Mq; ST (otherwise they annihilate out). In fact, asymmetries in the charged 
lepton sector are natural at non-zero Az, since charged leptons carry lepton number. 
Asymmetries in quark sector are then required for the overall electric neutrality. 
Asymmetries A,,, Ag and Aj, are proportional to each other (and to the total 
lepton asymmtery Az), with proportionality coefficients depending on temperature: 
the lower the temperature, the more species disappear, as their masses exceed T. 
Needless to say, quark (or rather hadron) asymmetries, and, by elecric neutrality, 
charged lepton asymmetries, vanish well below the QCD transition temperature 
Tocp ~ 200 MeV, when quarks and gluons are confined in hadrons which annihilate 
out because their masses are larger than temperature. 


As an example, let us find the relations between the lepton and quark asymmetries slightly 
before the QCD transition, T > Tocp ~ 200 MeV, but T < me,m-. In this case, rela- 
tivistic are three neutrino species, electron, muon and u-, d-, s-quarks. We introduce the 
chemical potentials for lepton number wz, baryon number upg and electric charge wa and 
write the chemical potentials (5.3) of relativistic particles in plasma: 


= eee 
Hva = Ub, Mu = 3HB + Sha, 
1 1 
He = My = HL- HQ, Hd = Me = ghB— GHQ- 


We then make use of Eq. (5.22) to obtain the asymmetries 


ga T. ii 2 
Ky =i Ky ial S = 3 
a = GME 623 zz $ Zua) (7.36a) 
Das a Aene a (7.36b) 
e =A =z 2- g), Aa=As= = UB — sha), 


where we recall that charged leptons and quarks have two spin states and quarks exist in 
three colors. Electric charge and baryon number densities are then proportional to 


Qu —Ac Aut Zu za ahs Bact (Aut Aa tA). 


These should vanish, so we have two relations between the three chemical potentials pz, 
LQ, B. Explicitly, the solution to Q = B = 0 is we = wit /2, we = 0, and the relations 


Introduction to the Theory of the Early Universe Downloaded from www.worldscientific.com 


by UNIVERSITY OF NEWCASTLE on 07/07/17. For personal use only. 


7.8. Sterile Neutrinos as Dark Matter Candidates 173 


(7.36) give finally 
Ae = Ap = Ana, Aa=As=—Ar,, Au = 2A. (7.37) 
Interactions of active neutrinos with matter give rise to additional contribution 


to the interaction matrix (7.18), which at the relevant temperatures T < 1 GeV 
reads [103] 


Voa = V2 GF |2 (nua (T) — noa (T)) + S os (T) — nos (T)) 
B#a 


+ (5 + 2sin? bay (7) = nalT)) + (-5 cual ow) 


x Xo (na(T) — na(T)) + E + ; sin? or) (nq(T) — na(T)) |- 


q=d,s 


These are obtained in complete analogy to the calculation of the potential due to 
electron asymmetry in the Sun, Eq. (C.28). One makes use of the Fermi theory (see 
Appendix B.5), and finds that the potential term equals 


(V) = —(Le) — (En), 


where the thermal averaging over all fermionic components is done with the help of 
relations 


FPA =n, (fof) =0. 
The latter relation implies the absence of macroscopic currents in the plasma. 


Problem 7.7. Obtain the explicit form of the neutrino effective potential Vag at 
temperatures above the QCD phase transition. 


Taking the electron—neutrino asymmetry A,, as a reference parameter, we define 


2/20? 
gGr T’. 2Ay, + 5° Ayt.. ee 


4 
Bza a 


o 22T? 


where the numerical factor Aae for each neutrino flavor œ depends on temperature 
for the reason explained above. In particular, Aee = 7 — 2sin? 0w ~ 6.5 when 
c-quark is still relativistic, and Aee = 6 at lower temperatures but above the QCD 
transition. 
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Now, the oscillation probability formula (7.19) shows that there are resonant 
transitions when 


Vaat? = 1. (7.38) 


At that time mixing becomes large, sin? 26™** = 1 (we assume small vacuum mixing 
angle). The transitions into sterile neutrinos occur most rapidly at the resonant 
temperature, which is found from (7.38): 


ms \1/2 (10-4\*/4 / 160 \1/4 
Tres = 5 ( z ) a -1/4 M ; à 
50 TkeV (= ) (=) y eV (7.39) 
where 
E 
y= T 


The simplest possibility which is realized at fairly large vacuum mixing angles 
is that active neutrinos of resonance energies get rapidly converted into sterile neu- 
trinos. On the other hand, antineutrinos do not experience the resonant conversion 
because of the opposite sign of Va. As active neutrinos, but not their antineutrinos, 
get converted, lepton asymmetry decreases and eventually becomes so small that 
the resonant transitions terminate. This happens when the total lepton asymme- 
try A; becomes considerably smaller than the initial one, A; ;: almost all initial 
lepton asymmetry leaks into the sterile neutrino sector. Hence, after the resonant 
transitions, the number of sterile neutrinos per unit entropy is of the order of the 
initial lepton asymmetry, ns/s ~ Az. This gives 


ke AL, isoMs | 
Pe 
Numerically 
Ms Ar i 
Q, ~ 0.2: ( ) ls A 
3 keV (34) (ren 


This corresponds to nearly vertical parts of dashed lines in Fig. 7.2: the present 
mass density of sterile neutrinos is almost independent of the mixing angle ĝa. Note 
that the initial lepton asymmetry Az, is by a factor of 3 to 5 greater than the 
initial electron neutrino asymmetry; see (7.37). Note also that the numbers near 
the dashed lines are the values of the present electron—neutrino asymmetry, which, 
depending on the mixing angle, may be substantially (by a factor of several) smaller 
than the initial asymmetry because of the wash-out process resulting in the sterile 
neutrino production. 

It is worth noting that in this scenario, the average momentum of sterile neu- 
trinos produced by the resonant mechanism is lower than that in the case of non- 
resonant oscillations (7.33). The resonance conditions are reached earlier (at higher 
temperature) in the low energy part of the spectrum, see (7.39); so, the production 
of sterile neutrinos begins at low y = E/T and ends when the lepton asymmetry 
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gets washed out. Thus, the maximum energy of sterile neutrinos is estimated as 
follows: 


Ymar 7 I2 
| falu) Amy dy ~ Fe rAr . (7.41) 
0 5 


For realistic values of the initial asymmetry Az < 1073 this gives ymax = Emax/T < 
0.7. 

For small vacuum mixing angles the above simple picture is incorrect. Only a 
small fraction of the initial lepton asymmetry is reprocessed into sterile neutrinos, 
so the sterile neutrino mass must be larger at smaller 0. Nevertheless, the resonant 
mechanism is realistic in a large range of vacuum mixing angles and masses. At 
small vacuum mixing angles there are three relevant time scales: oscillation time 
in vacuo ¢?2°, oscillation time at resonance t®®t(Tpes) = £2%°/ sin 20, and neutrino 
mean free time Ty. They obey 


(OO Tie <p Se. (7.42) 


The Boltmann equation written for neutrino of thermal energy E ~ 3T has the 
form (7.25) with 


(P(vq > vs)) = sin? (m) (EY si 26. 


2emat(T) pac 
x 2 
= (=) sin? 20, ab mL eT) (7.43) 
and 
1 {mat T 2 
(Plva > Ys) = 5 (=) sin? 20a at r, SHOT), (7.44) 


where we make use of (7.19a) and (7.19c) (see also (7.21)). Note that Eq. (7.43) is 
valid away from the resonance. As a function of t**(T’), this conversion probability 
increases as long as t™**(T) < T, and stays constant at t™*"(T) >> 7,. Hence, the 
conversion is efficient only at ¢@**(T) > 7,, i.e., near the resonance.” The overall 
number of sterile neutrinos is obtained from (7.25) and (7.43): 


Ns Ty Ty > AT 
— ~x — 204 =—, 7.45 
ss IHE T (e 
where AT is such that 
aT EAT) Tps (7.46) 


5In fact, the process we consider is closer to scattering rather than to oscillations, as the weak 
scattering rate exceeds the effective oscillation rate. 
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and all other quantities are evaluated at T = T,-;. To estimate AT, we make use 
of (7.19b) and (7.38) and write 


a 


Vac =l 
T + AT) = ee [i ) An) 


dlogT Tres 
Both Vya and t?*° have power-law dependence on T, so the relation (7.46) gives 


AT poae 


a 


Treg Ty 
In this way we obtain 


Ns ny, | sin? 26 


— n 


s s Hte 4 ’ 


(7.47) 


where the quantities in the right-hand side are calculated at the resonance tempera- 
ture (7.39). The numerical estimate of 2, is fairly complicated, since the parameters 
gx and Age in (7.39) and in H = T?.,/M%, depend on temperature; its simplified 


res 


sin220\ ; ms \3/2( An NP“ 
a, = 02: (FEF) (5) (as) ce) 


Numerical predictions made in the case of electron neutrino mixing with sterile 


form is 


neutrino are shown in Fig. 7.2. As we already pointed out, nearly vertical parts 
of dashed lines there correspond to the case (7.40). (Qs is nearly independent of 
sin 20.) To the left of these parts the resonant mechanism is inefficient because of 
too small lepton asymmetry and too small sterile neutrino mass, while to the right 
of these parts one needs heavier sterile neutrinos for lower sin 20. In this region 
our simple analytical approximation works reasonably well. In the latter case the 
sterile neutrino spectrum strongly depends on Az; in general it is shifted towards 
lower energies [105], in some similarity with the simplest case of Eq. (7.41). This 
modifies the cosmological bounds on sterile neutrino mass: sterile neutrinos are 
not as warm as in the non-resonant case. So, in the case of resonant production, 
the sterile neutrino mass may be considerably smaller than in the non-resonant 
production model. 


In the case of fairly light sterile neutrino, ms < 1 keV, there exists another relevant time 


scale over and beyond the scales listed in (7.42). This is the time in which the resonance 
is traversed due to the expansion of the Universe. To this end, let us discard the active 
neutrino scattering for the time being and consider the effective Hamiltonian 


H= ( a a) , (7.49) 
—W12 W2 
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where the entries are read off from (7.18): 


2 2 
ms o ms. 
wy = Vaca, wy = JE’ W12 = TE sin 204. (7.50) 


(We neglect terms of order sin? 28a; these can be straighforwardly taken care of.) In our 
case the off-diagonal matrix element w12 is small. All entries in (7.49) depend on time, so 
the system spends finite time in the resonance region. This sets the new time scale. To 
estimate it, let us consider the off-diagonal part of the Hamiltonian as perturbation and 
write H = Ho + Hi, where 


0 0 
mael E: par 
0 we —Ww12 0 


Importantly, |wı —w2] is large at almost all times except for a short time near the resonance, 
while at the resonance one has wı = w2. We are interested in the probability of transition 
1 — 2. To the first order in perturbation theory, the relevant solution to the propagation 
equation 


id = Hw (7.51) 


is 


y= ewido wile! at! (;) cs C(t) N wa (t')dt' (2) , (7.52) 
where C(—oo) = 0 and C(+o0c) is the transition amplitude. Equation for C(t) is obtained 
by plugging (7.52) into (7.51): 

C= —~iwie et So lene) wale at" 
which gives 


+020 
Gtr) = -i f dt wig ent SEa wta 


—co 
The integrand here rapidly oscillates at almost all times except for the resonance region, 
hence the integral is saturated near the resonance. In the resonance region we approximate 
wW12 = wi2(tres), wi(t) — we(t) = (w1 — W2)(tres) : (t — tres), set tres = 0 (Otherwise we 
would have to keep track of an irrelevant phase factor in C.) we obtain finally.® 

+oco 


1/2 27 


Eri A 2 
-iljo —wo]t 
dt e iglor=ozlt" — —i W12 


C (+00) = —iw12 f 


—oo 


(7.53) 


[ùi — ù] 
where all quantities are evaluated at the resonance and we recall that (w1 — w2) < 0 in our 


case. The probability of transition 1 —> 2 (i.e., Va — Vs) is 


2 
2TwÍ2 


P(va > Vs) = P(1 > 2) = |C(+00)|? = 


[ùi — we| 

This result is valid provided that w? << |w1 — we|. In fact, it is a particular case of the 
Landau—Zener formula; see Refs. [108-110] for its application to neutrinos. The formula 
6 The integral is implicitly regularized by considering w12 decaying at large |t|, e.g., like w12 œ% ent ; 
where the regularization parameter e€ is set equal to zero after the calculation. 
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gives the transition probability for small w12 but arbitrary wis /|wi — we: 
P(i>2)=1-e°7"", 


where 
2 


w12 


a (tres). 


[r — wal 
The point of the above calculation is the estimate of the effective time scale of the traversal 
through the resonance. This time scale is read off from (7.53): the integral is saturated at 


1 
ttrans ~ A. 
v lùi — ùl 
In the cosmological setting we have 


d(w1 — we) d(w1 — w2) 
EE = SE A = Aon A : 
dt dlog T ie (629) 


(since wı x T? while w2 x T~'). For neutrinos of thermal energy E ~ T this gives 


i 1 Mz, 
trans ™ ea : 
wo H m2Tres 


For ms < 1 keV this scale is shorter than the active neutrino free time 7,, so the resonant 
transition occurs between the collisions of active neutrinos, i.e., in free flight. In this situ- 
ation the formulas (7.43) and (7.44) do not work, and the transition probability is given 
by (7.54). 

Curiously, in the latter case the parametric dependence of the sterile neutrino abun- 
dance is the same as in (7.47). Indeed, we recall (7.50), make use of (7.54), (7.55) and 


obtain (modulo factor of order 1) 


2 
Ns Nv, 2TWi2 Nya W2 . 2 

~ —— nmw — sin^ 20a, 
s s |w1 — ù| s H 


which is parametrically the same as (7.47) given that w2 = (t2"°)~+. So, the effect we have 
studied does not grossly modify the estimate (7.48). 


Chapter 8 


Big Bang Nucleosynthesis 


The earliest epoch in the hot Universe which has been tested observationally is the 
Big Bang Nucleosynthesis epoch (BBN). As we will see, it begins at temperature 
of about 1 MeV and lasts until the temperature drops to a few dozen keV. At this 
time, neutrons in cosmic medium combine with protons into light nuclei, mostly 
helium-4 (tHe = a-particle) with small but measurable admixture of deuterium 
(D=7H), helium-3 (He) and lithium-7 ("Li). The main thermonuclear reactions of 
BBN are listed in the beginning of Sec. 8.3. We will see that at relevant temperatures 
neutrons are less abundant than protons; “extra” protons remain free and in the 
end form hydrogen atoms. 

Measurements of the primordial chemical composition not only confirm the the- 
ory of the hot Universe, but also provide the determination of an important cos- 
mological parameter, baryon-to-photon ratio 7g. Furthermore, BBN constraints 
models pretending to extend the Standard Model of particle physics. 

Precise calculation of light element abundances produced at the BBN epoch is 
a complicated task. An appropriate tool here is the numerical analysis of kinetic 
equations with account of numerous thermonuclear reactions. See, e.g., Ref. [115]. 
In this Chapter, like in many other Chapters of this book, we limit ourselves by 
order-of-magnitude estimates having in mind our main purpose: to discuss physics 
of processes in the early Universe and explain, at qualitative level, the dependence 
of the results on cosmological parameters. 


8.1. Neutron Freeze-Out. Neutron-Proton Ratio 


The first stage of BBN is neutron freeze-out. We will see in a moment that it occurs 
at a temperature of about 1 MeV, which is still too high for light nuclei production. 

In the early Universe, neutrons are produced and destroyed in weak interaction 
processes 


pean + re, (8.1) 
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and crossing processes. The energies relevant here are the mass difference between 
neutron and proton, 


Am = Mn — Mp = 1.3 MeV 


and electron mass Mme = 0.5 MeV. Let us assume for simplicity that the temperature 
is high enough, 


T = Am, me. (8.2) 


Then, like in Chapter 7, the mean free time of a neutron with respect to the process 
(8.1) can be estimated on dimensional grounds, 


m=r n In=CnGel’, (8.3) 


where Cn is a constant of order 1. The processes (8.1) and crossing processes 
terminate when the time Tn becomes of the order of the Hubble time, i.e., 


T? 
T ~w = i $ 
a(T) ~ H(T) = ape (8.4) 
As before, 
M 
1.66,/g« 
and the effective number of relativistic degrees of freedom is 
7 
g =2+ F-44202, (8.6) 


The first and the second terms here are due to photons and electrons /positrons, 
while the third term comes from neutrinos. We temporarily denote the number of 
neutrino species by N,, the actual value is N, = 3. Let us recall here (see Chapter 7) 
that at temperatures T > me, i.e., before electron—positron annihilation, neutrinos 
have the same temperature as photons. 
Equations (8.3) and (8.4) determine the neutron freeze-out temperature, 
1 


The constant Cn entering (8.3) is known: the process (8.1) originates from the 
four-fermion vertex shown in Fig. 8.1(a); the same vertex describes neutron decay, 
Fig. 8.1(b). Hence, the constant Cn is determined from neutron lifetime; numerically, 
Cn = 1.2. Thus, freeze-out temperature (8.7) does not contain unknown parameters. 
It depends, however, on the number of light neutrino species; see (8.5) and (8.6). 
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(a) (b) 4 
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Fig. 8.1. Feynman diagrams for processes (a) n + ve > p + e and (b) n > peDe. 


Recalling that the Fermi constant is Gr = 1.17 - 1075 GeV (see Sec. B.5) and 
gx = 43/4 for N, = 3, we find numerically 


Tn © 1.4 MeV. (8.8) 


It is worth noting that our assumption (8.2) is not well justified, so a more accurate 
calculation is needed. We quote the result below. 

It is amazing that neutron freeze-out temperature and the mass difference Am 
almost coincide. This property would not hold if the masses of u- and d-quarks, or 
the Fermi constant, or Newton’s gravity constant were different from what they are. 
This is one of the coincidences which makes the chemical composition of primordial 
plasma interesting at all. Due to this coincidence, the neutron abundance at freeze- 
out is rather large, so the abundances of light nuclei are considerable in the end: 
if it turned out that Am > Thn, then the neutron abundance at freeze-out would 
be exponentially small, nn x exp(—Am/Tn); see (8.13). On the other hand, for 
Am «& Tn neutrons and protons would be equally abundant at freeze-out, so all of 
them would end up in He nuclei and the primordial plasma would lack hydrogen — 
such a hydrogen-free Universe would hardly be habitable. 

Let us estimate the neutron abundance after freeze-out. With reasonable accu- 
racy, it is equal to the neutron abundance just before freeze-out. It is useful for what 
follows to write the general formula for the number density of particles of type A 
(protons, neutrons, light nuclei) in chemical equilibrium at temperature T << ma 
(see Chapter 5): 


pN 3/2 
NA=9A (= ) e(Ha—ma)/T (8.9) 


where ua is the chemical potential for particle A. We now apply this formula for 
protons and neutrons and make use of the fact that reaction (8.1) is in equilibrium 
just before freeze-out, hence up + He = Un + Hv, ie., 


Ln = Hp + He — Hv. (8.10) 
Electrons and neutrinos are relativistic, and we have from (5.22) 
Nne- — Net ~ HeT’, (8.11) 


so that 
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The difference between number densities of electrons and positrons is equal to the 
number density of protons by electric neutrality, 


Ne- — Net = Np, 


while np/T? is of the order of the baryon-to-photon ratio, np/T? ~ ns ~ 107°. We 
conclude that the chemical potential for electrons is negligibly small, 


He =9 
= ~ 10. 
T 
Let us assume here that lepton asymmetry of the Universe is small (we will discuss 
the opposite situation in the end of this Section), i.e., 
ny — No K Ny + no ~ T”. 
Then p,/T is also negligible, and we obtain from (8.10) that 
lin = Hp: (8.12) 
It then follows from (8.9) that neutron-proton ratio at freeze-out is 


Nn L 9—(mn—mp)/Tn 


np 


aig Ate, (8.13) 


(We used the fact that both proton and neutron have two spin states, and set 
Mn = Mp in the pre-exponential factors.) 

Neutron—proton ratio (8.13) is roughly of order 1, so one needs to calculate the 
freeze-out neutron abundance with good precision. The result is [112, 113] 


— = 0.18, (8.14) 


which corresponds to the effective freeze-out temperature Ta ~ 0.75 MeV, cf. (8.8). 
Note that the neutron—proton ratio depends on the number of light neutrino species 
N,. More generally, the dependence is on the effective number of relativistic degrees 
of freedom in the plasma at T ~ 1 MeV. On the other hand, there is practically no 
dependence on other cosmological parameters. 

Let us calculate the age of the Universe at neutron freeze-out. According to 
(3.29) we have 

1 _ Mñ 
2H(T,) 272° 
For Tna = 0.75 MeV and N, = 3 we get numerically 


t= 


= LiL ss 


Thus, BBN theory deals with the cosmological epoch beginning at one second after 
the Big Bang. 

To conclude this Section, we recall that the above calculation has been done 
under the assumption of negligible lepton asymmetry. If this assumption is not 
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valid, then instead of (8.12) we have un = Hp — Hv., and instead of (8.13) we obtain 


Nn Am Hv 
— =e — — —= ]. 8.15 
m — exp (5 - he) (8.15) 
The neutron—proton ratio in the end determines the *He abundance (see Sec. 8.2), 
Nn 
Nao X —. 
Np 


By comparing BBN theory predictions with observations, one finds that the devia- 
tion of the ratio nn/np from the standard value should not be large, 


|A (nn /Mp)| 


Nn /Np 


< 0.025. 


This leads to the bound on chemical potential for the electron neutrino, 


| Bite 
T 


< 0.025. 


A more accurate bound is given in (7.16). 

Big Bang Nucleosynthesis is also sensitive to the effective number of relativistic 
degrees of freedom in the cosmic plasma at T ~ 1 MeV. This number is traditionally 
parametrized in terms of the number of neutrino species Nes. In the Standard 
Model Neff = Nv = 3, but Nerf may take a different value in extensions of the 
Standard Model. One of the reasons! for the dependence of helium abundance on 
Nerf is that the neutron freeze out temperature (8.7) depends on gẹ (see Eq. (8.5)): 

Ta x (1/M%,)"/3 x gt’. 
Hence, the deviation of n,/n, from the Standard Model value (8.13) is, to the 
leading order in Ag,., 
A(nn/np) _AmAT, Am Ag, 
Halts) Te Ta Te Oe 
Now, from Eq. (8.6) we find 


Ags _ (7/8)-2ANes¢ 7 
SM = gANet 


gx g 
and obtain finally 
Bilt) T Atang fa) 
Requiring this to be less than 0.025, we get a limit 
|[ANeff| S 0.5, (8.17) 
which estimates the sensitivity of BBN to Neff. 


l Another reason, which we ignore here, is that the time of the creation of light nuclei tyg also 
depends on gx; see Problem 8.2. 
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Fig. 8.2. BBN data allow the range in the parameter plane (Neff, Hve/T) inside the inclined 
strip [118]) (68% C.L.). This illustrates the degeneracy in these parameters. The degeneracy is 
lifted by CMB data (vertical strip, 68% C.L.), under an assumption that neither lepton asymmetry 
nor abundance of new relativistic particles change during the evolution from the BBN epoch to 
recombination. 


There is, however, a degeneracy between Nepp and the chemical potential of the 
electron neutrino, as they have the same effect on n,,/n»,. According to Eqs. (8.15) 
and (8.16), the degeneracy is along the lines 


_ Hve + 7 Am Hve 


E Npn 0.05A Nf = const. 
T æn “f T sama 


So, if there are both hypothetical light particles and lepton asymmetry in cosmic 
plasma, their effects on the helium abundance can cancel out. This is illustrated in 
Fig. 8.2. The degeneracy is lifted by invoking other cosmological data, notably, on 
CMB.? 


8.2. Beginning of Nucleosynthesis. Direction of Nuclear Reactions. 
Primordial *He 


The chain of thermonuclear reactions in the early Universe begins with deuterium 
production in the reaction 


p+tn— D+y%. (8.18) 
This reaction is denoted in nuclear physics as 


p(n,7)D. (8.19) 


2To be accurate, the degeneracy remains there to some extent. The nonzero chemical poten- 
tial contributes also to the sum of neutrino and antineutrino number densities and thus mimics 
ANe fr = (90/7?) (uw /T)?, as follows from integrating the Fermi—Dirac destribution (5.4). So, 
the presence at the recombination epoch of a tiny number of new relativistic particles is indistin- 
guishable from the presence of a small lepton asymmetry. 
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Other nuclear reactions are denoted in a similar way. To calculate the temperature 
at which this reaction begins, we use the following approach. We assume that the 
reaction (8.18) is fast, so there is chemical equilibrium between deuterium, pro- 
tons and neutrons. We also switch off other thermonuclear reactions. Under these 
assumptions we calculate equilibrium abundance of deuterium at temperature T. 
This abundance is small at high temperatures, and we conclude that deuterium 
is not produced (cf. Sec. 6.1). The physics behind this phenomenon is that the 
deuterium nuclei, that are created in neutron—proton collisions, dissociate very fast 
back into neutrons and protons by absorbing hard photons from the tail of thermal 
distribution. The deuterium production begins when its equilibrium abundance cal- 
culated under above assumptions becomes comparable to abundances of neutrons 
and protons (the latter are of the same order of magnitude, see (8.14)). 

This “equilibrium” approach enables us to determine the direction of thermonu- 
clear reactions. Whether or not these reactions are actually fast depends on their 
cross sections and the cosmological expansion rate. In fact, the latter is rather high, 
and chemical equilibrium is not reached. Because of the latter property, the relic 
abundances of D, *He, “Li are not negligibly small: in chemical equilibrium, all 
neutrons would end up in He, the most tightly-bound light nucleus. 

Continuing with the equilibrium approach, let us write (8.9) as the Saha equa- 
tion. Because of thermonuclear reactions, chemical potentials of neutrons and pro- 
tons are different. The formula (8.9) gives for neutrons and protons 


3/2 
Nn =2 (=) e(Hn—mn)/T (8.20) 
20 
T\ 3/2 
Ny = 2 (= ) e(He—™p)/T (8.21) 
20 


where we neglected mass difference of neutron and proton in pre-exponential factors. 
If a nucleus of atomic weight A and charge Z is in chemical equilibrium,’ then its 
chemical potential is 


HA = Hp’ Z + Un: (AZ). 
Indeed, chemical equilibrium means that there is a chain of fast reactions leading 


to production of a nucleus (A, Z) from Z protons and (A — Z) neutrons. 
Proceeding as in Sec. 6.1, we obtain from (8.9), (8.20), (8.21) that 


on \ #A-D 
na = nina 2 gaa ( =) e^a/T, (8.22) 
Mp 


3Hereafter subscript A labels nucleus (A, Z); to simplify formulas, we do not use more accurate 
notations like uA, z. 
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where we set ma = Am, in the pre-exponential factor and introduced the binding 
energy of nucleus (A, Z), 


Aa = Zmp + (A — Z) Mn — Ma. 


Let us also introduce dimensionless ratio of the number of baryons in nuclei (A, Z) 
to the total number of baryons, 
A 
Xa = ZA, 


Np 


Then Eq. (8.22) is written in the form of the Saha equation, 


Qn (A-1) 
Xa = XF XA ng ee (- =) e^a/T, 
P 


The number of baryons is 


2¢(3 
Np =e * Ny = Neg: ale = 0.24n,T?. 


So, we have finally 


2.57 \ #4-) 
Xa = XXI gd (=) ow (8.23) 
p 
Like in Sec. 6.1, the right-hand side of this equation contains a small factor 
nám! (T/m)? ^, so the equilibrium abundance of nuclei becomes sizeable at 
T < Ay only, i.e., when the temperature becomes much smaller than nuclear bind- 
ing energy. 
Nucleosynthesis begins when production of deuterium becomes thermodynam- 
ically favored. We find from Eq. (8.23) that modulo a numerical factor of order 
one 


3 
X 2.57 \ # 
AD a m( ) eP, (8.24) 


where Ap = 2.23 MeV, and A = 2, Z = 1 for deuterium. Let us introduce temper- 
ature Tp at which equilibrium abundances of neutrons and deuterium are equal: 


Xp 
—(Tp) = 1. 2 
XD (tp) (8.25) 
We find from (8.24) and using ns = 6.05- 107° that 

Tp = 65 keV. 


In fact, nucleosynthesis begins somewhat earlier, since the cross sections of relevant 
thermonuclear reactions are fairly large, and deuterium burns already at the time 
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when its abundance is small, Xp/Xn < 1. We discuss this point in Sec. 8.3.2, and 
here we just state that the characteristic temperature is (see also [112, 114]) 


TNs = 75 keV. 


We note that this temperature depends on 7, weakly (logarithmically). 

Even though the temperatures Tp and Tys are numerically close to each other, 
the physical situations at these temperatures are quite different. At T = Tys, 
there are many more neutrons than deuterium nuclei in cosmic medium; duterium, 
once created, rapidly burns into heavier nuclei. On the other hand, the medium at 
T =Tp contains roughly equal number of free neutrons and deuterium nuclei, but 
their abundances are already small, as we will see below. 

In fact, at these temperatures the production of “He is thermodynamically 
favored. Let us see this, making use of the equilibrium approach. If practically 
all neutrons at T ~ Tyg are indeed in *He, then Eq. (8.23) must give Xay. ~ 1, 
while abundances of all other light nuclei, including free neutrons, must be small. 
Let us write Eq. (8.23) for He (A = 4, Z = 2, ga = 4), 


9/2 
25T eae 
Xaye = hee - 83 (=) eMue/T 
There are more protons than neutrons in plasma (see (8.14)), so extra protons give 
Xp ~ 1. Again omitting factors of order 1, we express the neutron abundance in 
terms of Xe, 


Ag fr" 

A= he (=) oe Atte / 22 (8.26) 
Mp 

By substituting (8.26) into (8.23), and taking X4y,. ~ 1 we find for other nuclei, 

modulo factors of order 1, 


$ 
mgA AA-^4ype(4-Z)/2 


e P 


AA-^4ye(4-Z)/2 


~ 1073(4+2-32) er f (8.27) 


where numerical value of the pre-exponential factor is given for nz = 6.05 - 10719, 
T = 75keV. Note that the sign in the exponent depends on binding energy per 
neutron, A,/(A—Z). Among light nuclei, this quantity is the largest for *He, that 
is why *He is predominantly produced in the early Universe. 

Binding energies of relevant stable or sufficiently long lived light nuclei are given 
in Table 8.1. Making use of these data, we find from (8.27) the following estimates 
at T = 75keV: Xn ~ 107°, Xp ~ 10782, Xsy ~ 1071, Xsge ~ 1074, Xei ~ 
10768, Xz; ~ 107101, Xrge ~ 10750, Xap ~ 10764. Thus, at T = Tys equilibrium 
abundances of light nuclei are much smaller than that of tHe. 
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Table 8.1. Binding energies of some stable or long lived nuclei (MeV). 


Z Nucleus Aa Aa/A Aa/(A- Z) 
1 °H=D 2.23 1.11 2.23 
3H=T 8.48 2.83 4.24 
2 3He 7.72 2.57 7.72 
He =a 28.30 7.75 14.15 
3 Shi 31.99 5.33 10.66 
Thi 39.24 5.61 9.81 
4 TBe 37.60 5.37 12.53 
8B 37.73 4.71 12.58 


6 BG 92.2 7.68 15.37 


Let us make two points here. First, applying (8.27) to 12C we would get Xi2¢g > 
1. This means that if carbon could be produced, our assumption about domination of 
tHe would be wrong: almost all neutrons would end up in '*C (and heavier nuclei). 
However, one can see from Table 8.1 that !2C cannot be produced in two-body 
reactions involving lighter stable nuclei*: fusion of two °Li isotopes is very seldom 
because of tiny abundance of this isotope. (This isotope is produced only in the 
fusion of helium-3 and tritium, and this process is strongly suppressed as compared 
to production of helium-4 in collision of the same nuclei.) Elements with A = 5 and 
A = 8 cannot be produced in two-body reactions of abundant nuclei. This is why 
nucleosynthesis chain does not reach carbon in the early Universe. Thermonuclear 
reactions proceed towards production of +He. 

Second, if matter in the Universe were in chemical equilibrium with respect to 
production of *He at T > 75keV, the nucleosynthesis would occur at higher tem- 
peratures. This is because binding energy of tHe is greater than that of deuterium. 
However, “He is produced by deuterium burning, rather than directly from protons 
and neutrons. Hence, it is the production of deuterium that determines the nucle- 
osyuthesis temperature (“deuterium bottleneck”). In this sense the nucleosynthesis 
is delayed in the early Universe. 


Problem 8.1. Find nucleosynthesis temperature in a hypothetical case of fast pro- 
duction of “He directly from protons and neutrons. 


Let us now find the age of the Universe at the epoch of thermonuclear reactions, 
i.e., at Tyg = 75 keV. According to (3.29) we have 

1 — Mp, 
2H(Tws)  2Ths 


tng = (8.28) 


4Element !*C is produced in stars in the processes of creation of short-lived 8B in collision of two 
4He nuclei and subsequent fast absorption of yet another “He. This mechanism does not work in 
the early Universe because of low density of “He nuclei. 
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The expansion rate is determined by photons and neutrinos, and the latter are 
already frozen out. So, we have 


4/3 
Mpi i 7 4 
Mp, = —— h . =24=--2-N,-{ — ; 2 
eager (=) 48:29) 
The age of the Universe at T = 75keV is, therefore (for N, = 3), 
tys © 230s % 4 min. 


Using this estimate we now calculate the primordial abundance of helium-4. We 
will see in Sec. 8.3 that thermonuclear reactions proceed rapidly at T ~ Tyg. After 
BBN, most neutrons are collected in helium-4, so the abundance of helium-4 is half 
of that for neutrons, 


1 
naye(T vs) = atm (Tys). 


The latter, in turn, is related to proton abundance as follows, 


Nn (Tys) nn (Tp )e Tts / 7» 


TE L ee ad, 8.30 
np(Tns) Np(Tn) F tt, (Tn) (1 a e~tns/Tm ) ( ) 


where we modified the relation (8.14) by taking into account neutron lifetime, Tn ~ 
886s. As a result, we obtain the mass fraction of *He, 


M4He ' NaHel Tys) 2 
Xue = -m n (Toe = mle o ~ (8.31) 
Mp[np(Tys) + Nn (Tys )| -n 4 1 


As we discussed in the end of Sec. 8.1, this mass fraction depends on the num- 
ber of relativistic degrees of freedom through the neutron freeze-out temperature 
Tn. We see that there is an extra source of this dependence: the nucleosynthesis 
time tys which enters Eq. (8.30), also depends on gx. This modifies somewhat our 
estimate (8.16), although our crude numerical bound (8.17) remains almost intact. 

Overall, barring large lepton asymmetry, BBN is sensitive to the number of 
relativistic degrees of freedom and to the baryon-to-photon ratio. (The latter is 
particularly relevant for residual deuterim fraction, see Sec. 8.3.2.) The BBN theory 
is consistent with the data in the allowed range of these parameters shown in Fig. 8.3. 
These results constrain extensions of the Standard Model with exotic light particles. 


Problem 8.2. Refine the estimate (8.16) by taking into account that tys x 
H-—!(Tys = 75 keV) depends on gx. 


Problem 8.3. Find the lowest possible freeze-out temperature of hypothetical mass- 
less particles, assuming 20% uncertainty in gx. Do the same for uncertainty of 5%. 


Some uncertainty in the theoretical prediction of primordial helium abundance 
is due to uncertainty in measured neutron lifetime. The latter is relevant for both 
the temperature Tn (see (8.7), (8.13), (8.14)) and the number of neutrons remaining 
in the plasma at time tys (see (8.30)). 
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Fig. 8.3. Allowed ranges of the baryon density and number of relativistic species (68% and 95% 
C.L.) from BBN (larger ellipses) and CMB [119]. Crosses show the best fit values. 


8.3. Kinetics of Nucleosynthesis 


We have seen in the previous Section that nuclear reactions proceed towards produc- 
tion of helium-4. In this Section we discuss the rates of the most relevant reactions 
and estimate the residual abundances of other light elements. 

The direction of nuclear reactions that we found in the previous Section suggests 
for us to divide these reactions into several categories: 


(1) p(n, y)D, production of deuterium, initial stage. 

(2) D(p, y)?He, D(D, n)*He, D(D, p)T, *He(n, p)T, preliminary reactions preparing 
material for *He production. 

(3) T(D, n)*He, *He(D, p)*He, production of +He. 

(4) T(a, y)"Li, *He(a, y)"Be, “Be(n, p)’Li, production of the heaviest elements. 

(5) *Li(p,w)*He, burning of "Li. 


We note that the reaction rates are proportional to abundances of colliding nuclei, 
so among all possible reactions the most relevant are those involving at least one of 
the abundant nuclei, i.e., p, n, D, “He. 

Let us consider these reactions in turn, with the purpose to estimate their rates 
in the early Universe. By comparing these rates with the expansion rate at nucle- 
osynthesis, 


H(Tys = 75keV) = 2. 1073 s7}, 
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we will find the residual inequilibrium abundances. These are of course much higher 
than abundances that would be present in thermal equilibrium. 


8.3.1. Neutron burning, p+n—> D++¥ 


As we have seen, deuterium production becomes thermodynamically favored at 
temperature T = Tys % 75keV. However, the Universe expands rather fast, so 
some neutrons could in principle be not burned out. Let us show that, in fact, 
the fraction of neutrons which are not burned is negligibly small. To this end, we 
compare the rate of neutron burning with the expansion rate at t = tys. 

The cross section of deuterium production can be roughly estimated as the 
geometric cross section, 


a 1 1 546-18 cm? 


(Or) pian) ~ ma ~ 137 (200 MeV)? j 


where m, is the pion mass determining the typical spatial range of nuclear interac- 
tions, r ~ m71, while the fine structure constant œ accounts for suppression related 
to photon emission. Note that this estimate does not depend on the velocity of 
colliding particles, i.e., on temperature. In fact, the temperature dependence exists, 
and the corresponding corrections change the cross section by a factor of 1.5-2 at 
T ~ Tyg. Furthermore, since deuterium is a loosely bound nucleus, there is an 
additional factor w./pp, where wy ~ Ap is the photon energy and pp is the typical 
center-of-mass momentum of neutron and proton in deuterium. The latter can be 
found from the virial theorem, 
2 

ae ~ Ap, (8.32) 
where we assumed for the estimate that the interaction potential between proton 
and neutron is inversely proportional to the distance. The final estimate is 


3 
zg: Sm 


(ov)pin, y) D z6.107 (8.33) 


s 
Neutron burning occurs in collisions of neutrons with protons leading to deu- 
terium production. Its rate per neutron is given by 


¢(3) 


Diiny)D = Np ` (F%)pnyp = NB: 2 n2 T’. (ov) p(n,7)D 
=0.5s 1, for ng =—6.05-10°'°, T=Tys=75keV, (8.34) 


where we expressed the proton number density through baryon-to-photon ratio nz 
and photon number density at T = Tys. Since this rate is much higher than the 
cosmological expansion rate, Iy¢,,7)p >> H(Ts), neutrons indeed burn out, and 


practically all of them combine into deuterium." 


5That would not be the case for np < 107". 


192 Big Bang Nucleosynthesis 


Fig. 8.4. Sketch of the potential between colliding nuclei. 


Problem 8.4. Neglecting neutron decays, estimate the concentration of free neu- 
trons towards the end of BBN, i.e. at T < Tyg, where Tyg ~ 75 keV. 

Hint: At T < Tys deuterium rapidly burns into heavier nuclei (+He in the end), 
which are not destroyed at these temperatures. 


8.3.2. Deuterium burning 


Deuterium is the material from which tritium and helium-3 are produced. The cross 
sections of the reactions 


D+D—%He+n and D+D—T+p (8.35) 


could be estimated as geometric cross sections, but we have to take into account the 
Coulomb barrier: both colliding nuclei carry positive electric charge, so they repel 
each other. This repulsion dominates at long distances, r >> 1/m,, and inhibits the 
reactions. The form of the potential is schematically shown in Fig. 8.4. 

Because of the Coulomb barrier, nuclear reactions occur due to quantum tun- 
neling. To estimate the corresponding suppression factor for nuclei of charges Z1 
and Z2, masses Mı and Mə and velocities vı and v2, let us work in the center- 
of-mass frame. In this frame, the incident kinetic energy is Ekin = Mv? /2, where 
M = MıM2/(Mı + M2) is the reduced mass and v = vı — v2 is the relative velocity. 
(Even though we are currently interested in deuterium burning, it is useful for what 
follows to consider general case.) 

The tunneling amplitude is exponentially suppressed. For s-wave scattering we 
have 


A x exp -f V2M (V(r) — Exin)dr} , 
0 


where the turning point ro is determined by the relation 


1 LZ 
Ekin(r = ro) = Exin(r = 00) — V (ro) = g Me m 2m 
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in writing the exponent we assumed that ro > 1/m,. Thus, the exponent is 


ro fd 1 AVA 
-Vm | pa ene cel 9 
0 r ro v 


As a result, the cross section is suppressed as 0 x exp (—2raZ: Z2/v), and including 
pre-exponential factor we have 


BRO A | anak Zay, (8.36) 


where go is the geometric cross section in the absence of the Coulomb suppression. 
The expression (8.36) is to be averaged with the Maxwell—Boltzmann distribu- 

tion, 

I exp (- Mir = j vdv 


Jo. exp (F) Vdo 


(ov) = 00: 27aZ,Z2 - (8.37) 


The normalization integral in the denominator is straightforwardly calculated; it is 
equal to \/7/2(T/M)?/?. We evaluate the integral in the numerator by the saddle 
point method and obtain 


i. e Mvo? 2raZıZə d 
f Xp oT 7 vdv 


2T E Mv 27aZ1 Zo 
= vo, | = XP | -— — —— ], 
0 i ie ahi P oT Uo 


where the saddle point vo is determined by 


Mvo 27aZy Zə 


=] 2 
T UD 


As a result, (8.37) becomes 


2 MN?’ 3 MN! 
(ov) = oo: y (rozi) (F) e -3 Brozzi (F) |. 


We now introduce convenient quantities: dimensionless reduced mass of incident 
nuclei A = M/m, and temperature in units of billion Kelvin, Tọ = T/(10° K) = 
T/(86 keV). In these notations, the final result is 


(ov) = 9.3 - oo - gay" Aln , e7426: (Z1 Z2)? S AMST Y? (8.38) 


We note that this estimate for (cv) assumes that go is independent of momenta of 
colliding nuclei at energies in the interval Ekin ~ 10—100 keV. This is often not 
the case and such a dependence leads to more cumbersome expressions instead of 
(8.38). In particular, the pre-exponential factor often has different dependence on 
temperature as compared to (8.38). Furthermore, sometimes the expression (8.38) 
is not relevant at all, since the cross section is dominated by intermediate resonance 
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states. Finally, the production of new nuclei may not occur in s-wave scattering; 
in these cases the non-zero angular momentum gives important contribution to the 
effective potential determining the tunneling exponent. This yields contributions to 
the cross section that have different temperature dependence in the exponent as 
compared to (8.38). We omit these “details” here, and will use the correct expres- 
sions in appropriate places. We discuss the calculations of burning rates in more 
detail in the end of Sec. 8.3.4. 

Coming back to deuterium burning reactions D(D,p)T and D(D,n)*He, let us 
roughly estimate do by making use of the typical range of nuclear force, 
16 cm? 


To ~ me? ~ 107% em? ~ 3-107 
S 


As a result we obtain for these reactions (A = Z; = Z2 = 1 for DD initial state) 


3 = 
(ov) pp ~ 3: 19-8 Ty 2/8, QA 426-75 1° (8.39) 


In fact, this is a reasonably good estimate for the combined rate of reactions (8.35) 
at temperatures of interest, including Tg ~ 1. 

Already at the beginning of deuterium burning this process is fast. As an exam- 
ple, at deuterium abundance of 1073 relative to protons, the burning rate per deu- 
terium nucleus is estimated as 
2¢(3) 


3 S 
=, Tyg Ds 0.2 S > 


Tp~ (ov) pp : 10-°np 


which is much greater than the Hubble parameter. As we will see momentarily, it 
is for this reason that the charactristic nucleosynthesis temperature Tys = 75 keV 
is somewhat higher than the temperature Tp = 65 keV introduced in (8.25): deu- 
terium burns when its abundance is relatively small, and free neutrons get bound, 
predominantly in *He, already at higher temperatures. For approximate description 
of deuterium burning, let us introduce the quantity 


NDn = Nn + ND, 


which is the density of neutrons either in free state or in deuterium. We write the 
Boltzmann equation for this quantity, which accounts for deuterium burning and 


cosmological expansion® 


dnpn 
dt 


The right-hand side here descibes irreversible transition of neutrons into °H and 
3He, and then into tHe. Note that we ignore here deuterium burning in reactions 


+ 3Hnpn = —(ov) ppn. (8.40) 


SHere we use the fact that the total number of DD reactions per unit time per unit volume 
is (1/2) - (ov) ppn?,. Indeed, the reaction rate (inverse lifetime) of a deuterium nucleus equals 
(ov) ppnp, but one has to count half of all nuclei. However, each DD reaction kills two deuterium 
nuclei, and this compensates for the factor 1/2. 
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with heavier nuclei, to be discussed in Sec. 8.3.3. This approximation is sufficient 


for our estimates. 


The process p(ny)D and inverse process D(yn)p are fast; see Eq. (8.34). There- 
fore, neutrons and deuterium are in approximate chemical equilibrium with each 
other. We substantiate this claim later on; See Eq. (8.48). In this approximation, 


the abundances obey the Saha equation (8.24), hence 


ND es 
np = (=) -Npn = S(T) -NDn, 
np nn 


where 


3 
ga Xp/Xn . Xp _ "i zr a 
1+Xp/Xn Xn 
It is convenient to study the ratio np/T?, and, more precisely, introduce 


NDn 


YDn = R3 
re a 


where 


ap = 0.75.55 = 0.75np - 2¢(3)/m2 = 1.1- 1071? for ng = 6.05 - 107". 


(8.41) 


(8.42) 


Towards the end of BBN epoch, the quantity a,T? is equal to the number density 
of free protons (factor 0.75 accounts for the fact that a quarter of all baryons get 
bound ito helium), and npn is the final number density of deuterium nuclei. Thus, 


vpn T = |) = (T 0): 


Np 
On the other hand, before the BBN epoch we have 


Zn, =n nl 
"Dn = n = NBT n/n 
and 
Vpn(T > Tys) = vË = 1 _t/™ T 


0.75 1+ mn/np 


where we use the estimate (8.30). 
Equation (8.40) leads to the Boltzmann equation for Vpn: 


dvpn 


rs —(ov) pp ar S(T) Veni 


As usual, let us use temperature instead of time and recall the relation (3.34) to 


write 


dvpn, 


“Ot = (a0) Dp apMhi S(T) vbn- 


(8.43) 
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This gives 
1 1 
— = -=y + Io(T), (8.44) 
VYDn VDn 
where 
Ip(T) = I (ov) pp(T"’) - apM>p, - S°(T') aT’. (8.45) 
T 


As temperature decreases, the integral Ip (T) grows and Vpn decreases, as expected. 
By making use of the formula (8.44), we first estimate the characteristic temper- 
ature at which free neutrons and deuterium burn and most of neutrons get bound in 
3H, 3He and in the end in “He. We define this temperature Tyg as the temperature 
at which half of neutrons remain free or bound in deuterium nuclei, while another 

half are bound in heavier nuclei. This happens when vp,(Tws) = vi) /2, i.e., 
Ip(Tns) = —~ (8.46) 


We will see momentarily that Tys > Tp = 65 keV, so that deuterium is less 
abundant than free neutrons at T = Tys, i.e., Xp K Xn, and therefore 


3 

xX 25T\? ap 

s(t) =P = n (= ) or. 
n p 


At temperatures of interest the parameter (ov) pp is almost independent of tem- 
perature and is given by 


3 
ig 


(ov) pp(T > Tp) = 3.4-107 a (8.47) 


in accordance with the estimate (8.39). The integral [p(Tws) is saturated at lower 
limit of integration. Because of strong (exponential) dependence of $? on temper- 
ature, this integral at T > Tp can be calculated by using the variable T7! and 
integrating only the exponential factor exp(2Ap/T). As a result, we obtain approx- 
imate expression 


. mr 25T\? 20 
I(T > Tp) = (av) p0(T > To) «appx th (= ) er; 
P 


With Mz, = 4- 1018 GeV, we get from (8.46) 
TNs = 75 keV. 


We have used this estimate in our previous analysis. 

As the temperature decreases, the integral Ip(T) rapidly grows, and the relation 
(8.44) tells that the abundances of free neutrons and deuterium rapidly decrease; 
neutrons get bound in heavier nuclei. As a example, at temperature Tp = 65 keV, 
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when abundances of free neutrons and deuterium nuclei are equal to each other, the 
formula (8.44) gives’ vp(Tp) = vn(Tp) = [2Ip(Tp)|7! ~ 4- 1074. 


Let us check that free neutrons and deuterium are indeed in chemical equilibrium with 
each other at Tys = 75 keV. Chemical equilibrium holds if the deuterium creation rate in 
reaction p(ny)D (which in chemical equilibrium is equal to the rate of inverse reaction) is 
higher than the rate of deuterium burning in reactions (8.35): 


dnp dnp 
(=) = (9%) p(nq)D ` Npn > |- | = (ov) pv: np. 
p(ny)D DD 
The ratio of these rates is given by 
p= —rbpo-mb _ _(ov)pp_ Mn (y (8.48) 
(OU)p(ny) D` NpNnn (FV) p(ny)D Np \ Nn 


At T = Tys one has nn/np ~ vÊ? /2 ~ 0.1, while np/nn is given by Eq. (8.24) and its 


numerical value is np/nn = 1.4- 1077. Making use of the cross sections (8.33) and (8.47), 
we get R œ 107? <1, which is the desired result. 


We now use the result (8.44) to estimate the final abundance of deuterium. To 
this end, the lower limit of integration in (8.45) is set to T = 0. To estimate this 
integral, we replace $?(T) in the integrand by step function 6(Tp — T): at T ~ Tp, 
the parameter (cv) pp depends on temperature weakly, while S?(T) rapidly grows 
from zero to one as the temperature decreases and crosses Tp. Thus, the integral is 
estimated as follows: 


Tp 
nr == | (ov) p(T) - apM3,a4T. (8.49) 
0 


The dependence of (cv) pp is fairly important at T < Tp. To take this into account, 
we change the integration variable to y = Ty. 1/3 and write 


cae 
ip(f=0) = 9.7 -10° | 074764 dy for ns = 6.05-107"°, (8.50) 
y 
Yy 


D 


where yp = T = 1.1. This integral is much greater than (v)~!, and according 
to (8.44) the final abundance of deuterium is practically independent of the initial 
neutron abundance. We have for the final deuterium abundance 


D = ypn(T = 0) = 15 (FT =0) (8.51a) 
=2.107” for ns = 6.05- 107". (8.51b) 
This ratio remains constant after the BBN epoch. 


TThe approximation of chemical equilibrium between free neutrons and deuterium is not very 
accurate at T = Tp, so the results for vp(Tp) and vn(Tp) should be considered as order-of- 
magnitude estimates. 
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Since Tp depends on ns weakly (logarithmically), the main dependence of 
the deuterium-to-proton ratio (8.5la) on 7g comes from the parameter ap, see 
Eq. (8.42). It follows from the results (8.44), (8.49) that 


i.e., this fraction is inversely proportional to ns. The observational determination 
of the deuterium-to-proton ratio in the present Universe enables one to find the 
baryon density with good precision. 

Note that the integral in (8.50) is saturated in the interval Ay ~ 0.25, i.e., the 
relevant temperature range extends from Tp = 65 keV to 


-3 
h~ a 4 Ay) ~ 0.4, T~0.4-109 K ~ 35 keV. 


In this temperature range the free neutron abundance is small, nn < np, so that 
Npn = Np, and Eq. (8.40) describes deuterium burning without its production in 
the reaction p(ny)D. Therefore, our assumption of chemical equilibrium between 
deuterium and free neutrons is actually irrelevant for the estimate of the final deu- 
terium abundance. 

It is worth pointing out that an order-of-magnitude estimate of the final deu- 
terium abundance can be obtained in a very simple way. Namely, let us write the 
Boltzmann equation (8.43) at T > Tp (with vp substituted for vpn) as follows: 


dln vp 


TR = (ov) pp aM} Tvn. 


Deuterium burning terminates when the right-hand side of this equation decreases 
and becomes of order 1. This determines the final abundance 


a ~ ((ov) pp apMž T). (8.52) 
By inserting here the typical cross section (8.47) and temperature Tp ~ 65 keV, we 
obtain an estimate which has the same order of magnitude as the result (8.51b). 

Let us now consider one more reaction involving deuterium, D +p — y + 3He. 
Its cross section is much smaller than the cross section of the DD reactions we 
discussed above. The reason is that photon emission leads to the electromagnetic 
suppression. Hence, go ~ 107°?! cm/s and 


3 
- -1/ 
(ov) D(p,7)3He = 8 19 oc 2/3 | .-3.7-Ty A 


? 


where we made use of the general formula (8.38) with A = 2/3, Zı = Z = 1. The 
rate of deuterium burning via this channel is proportional to proton abundance, 


T = np: (OV) D(p,7)3He- 
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For ng = 6.05- 1071? and T < Tys this rate is well below the expansion rate. This 
reaction would be important at large ns: the larger is the number of baryons® the 
more deuterium is burned out. 


8.3.3. *Primordial ?He and 3 H 


Helium-3 and tritium produced in collisions of deuterium nuclei, then burn into 
helium-4. The simple estimate (8.38) does not work for helium-3 burning reaction 
3He + D — p + *He. (See discussion in Sec. 8.3.2 and in the end of Sec. 8.3.4). 
Instead, the reaction rate in the energy range of interest is well described by the 
formula 


3 
—15 CM —1/2_— m 
(0) 211e(D.p)4He = (90)¢Hep = 1075 —— -Tg YeI, (8.58) 
We give here an elementary analysis similar to that leading to the esti- 
mate (8.52). Like in previous Section, let us write the Boltzmann equation 


one = apMp,T ((ov}onen VD ` V3He — 5(ov)pp vs) i (8.54) 
where the first term in parenthesis describes burning of ?He, while the second term is 
due to ?He production in DD reaction. We have taken into account that the prduc- 
tion cross section of ?He is approximately twice smaller than the total deuterium 
burning cross section. At high temperatures, when a,Mj)(ov)3He pT Vp > 1, pro- 
duced ?He rapidly burns out, and its abundance is kept at such a level that the 
right-hand side of Eq. (8.54) is small, i.e., 


1 
V3H6e œ EAGUDD Up. (8.55) 
2 (OV) 3He D 
Burning of He terminates when 
QpMp(0v)3He pI Vp ~ 1. (8.56) 


The meaning of this relation is that the burning rate per ?He nucleus becomes of 
the order of the Hubble parameter, (0v)3He pnp ~ H (recall that a,vp = np/T®). 
We insert vp ~ 2-10~° into Eq. (8.56) and obtain an estimate for the temperature 


Tape ~ 35 keV, Tse 9 = 0.4. 


Note that at T ~ Ty. the abundance of deuterium vp is of the order of the final 
deuterium abundance (8.51b), so we can indeed make use of Eq. (8.51b) in (8.56). 


8Our emphasis in this section is on BBN determination of ng, so we keep this parameter free. We 
note, however, that irrespectively of BBN, this parameter is determined with good precision from 
CMB observations; see Fig. 8.2. 
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Problem 8.5. Show that at T >> Ty, the evolution rate of vse, calculated accord- 
ing to (8.55), is such that the left-hand side of Eq. (8.54) is small compared to each 
of the terms in the right-hand side. This justifies the result (8.55). 


At T = Tsye the abundance of ?He is still estimated as in Eq. (8.55). At T < Taye 
the right-hand side of Eq. (8.54) is dominated by the second term: the reaction 
D + D — °He + n still proceeds, albeit at low rate. The estimate for production of 
3He at T < Tay. gives, roughly speaking, the same result as in (8.55). In this way 
we obtain an estimate for the final abundance 

N3He (ov)pD 


1 
= — He, NE ep a 0,MeT 7 8.57 
V3He np 2(ov)eHe D VD 5) pl. pj (ov) pDYp; ( ) 


where the right-hand side is calculated at T = Tse. Numerically, 


N3He 
Np 


~ 1075, 


which is of the same order of magnitude as deuterium abundance. The latter prop- 
erty has to do with the fact that the burning rates (ov) are almost equal for 3He 
and deuterium at temperature T = 35 keV, when burning of both deuterium and 
3He terminates. 

Let us turn to tritium. Tritium nucleus is unstable, so its final abundance cannot 
be measured directly. However, tritium participates in production of other elements, 
and its abundance is important for BBN. Tritium burning in reaction 


T+D—‘4He+n. 


is treated in a similar way as for *He. The production rate of tritium is approxi- 
mately the same as that of *He, whereas the burning rate is 


3 = 
(ov) TD, n)4He = 10715 “ aa Tien, (8.58) 


Note that the temperature dependence of this rate is weak. Tritium burning rate is 
higher than for 3He, so tritium burns longer than ĉHe, and its final abundance is 
lower. We make use of the relation similar to (8.56) and find that tritium burning 
terminates at 


Tr ~12 keV, Tr 9 = 0.14. 
The formula analogous to (8.57) gives 


—~3-107" for ns = 6.05107". (8.59) 


Both ratios NnsHe/Np and nrT/np, are, roughly speaking, inversely proportional to 
Ns. More careful analysis gives nsye/np X lie see Fig. 8.5. 
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Fig. 8.5. Predictions of BBN theory for primordial abundances of He, D, ?He, “Li and the 
observational data at 2ø confidence level [1]: statistical errors (solid lines) and statistical and 
systematic errors together (dashed lines). Theoretical uncertainties are shown by thickness of 
the lines. Vertical strip “CMB” is the CMB result for ng. The parameter on horizontal axis is 


r % n "m : 
mo = Ne - 10°. On vertical axis are: Yp = —_“te He __ mass fraction of tHe (the same 
NH MH HNA He M46 


as X4ye in the text); np/ny, n3ay./ny and n7z;/ny, relative abundances of other elements. 
Subscript p in notations refers to primordial abundances. 


Let us make one point here. We have claimed that practically all neutrons end 
up in *He, and this is true. There is a non-trivial reason for that, however. Namely, 
burning of helium-3 and tritium occurs faster than their production from deuterium. 
Were this not the case, deuterium would burn out first, and reactions 7He + D —> 
tHe + p, T +D — tHe + n would terminate at the stage when neutrons are bound in 
helium-3 and tritium, rather than in “He. On the other hand, freeze-out abundances 
of ?He and T are sizeable, since their burning rates are not vastly higher than 


deuterium burning rate at T ~ Tyg. A certain diversity of light elements in cosmic 
medium after BBN is the result of rather accidental coincidences between low energy 
thermonuclear cross sections. 
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8.3.4. *Production and burning of the heaviest elements 
in primordial plasma 


As an example of reactions involving the heaviest elements of primordial plasma, let 
us consider production and burning of “Li in reactions T(a, y)’Li and ‘Li(p, a)*He, 
respectively. 

Production reaction is reasonably well described by the formula (8.38), in which 


co X M7? -a (the factor of æ is due to photon emission). Numerically, 


3 seas 
(OV) T(a,7)7Li ~ gm i ja ee KA (8.60) 


The rate of tritium burning via this channel is 
(ov) ral Na S 1.5- 1074s7t, To = 0.75. 


This rate is small as compared to the Hubble parameter. The burning reaction is 
also described by formula (8.38), and we obtain 


3 
15 Cm a le 


(0V})TLi(p,a)tHe ~ 107 5 


Here the parameter go is determined by strong interactions only, so the rate is much 
higher than the production rate (8.60). Numerically, the burning rate of lithium-7 is 


(ov) 71i(p,a)4He ‘Np X 0.7 s71, at To =0.75, ng = 6.05107, 


which is higher than the Hubble parameter. Hence, burning of “Li terminates rather 
late, when the number density of protons gets diluted substantially due to the 
cosmological expansion. 

Temperature at which burning of “Li terminates, is determined from the relation 
(cf. (8.56); we recall that v, = 1) 


ApM>p) (OV) rLi(p,a)4He ~ 1. 
We obtain numerically 
Tri ~ 18 keV, Trgi g = 0.2. 


The final abundance of "Li produced in the way we consider is estimated as (cf. 
(8.57)) 

(OU) T(a,7)7Li 
(0V})TLi(p,a)tHe 
where Va = Na/Nnp = (0.25/4)/0.75 ~ 0.08 is the abundance of “He, and we use 
(8.59) for tritium abundance.? 


-VpVo ~ 6: 107", (8.61) 


Vit = 


9Since T71; > Tr, using (8.59) for tritium abundance is not quite legitimate. Instead, one should 
make use of the relation similar to (8.55). However, this does not grossly modify the result, since 
T7; and Tr are close to each other. 
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The production of Be is also important. This isotope is unstable, so its 
abundance is not directly measurable. Beryllium-7 transforms into lithium-7 in 
cosmic plasma either via electron capture Be(e7,ve) Li, or in the reaction 
"Be(n,p)’Li. Thus, lithium-7 is produced either directly, in tritium-a fusion, or 
through beryllium-7. The existence of the two production mechanisms gives rise to 
non-monotonic dependence of the primordial lithium-7 abundance on 7,;. Formula 
(8.61) is valid for low 7; where the abundance of “Li decreases as Ns increases, 
while realistic value of 7, corresponds to the range where the abundance increases 
with ns. The latter behavior is due to processes involving “Be. 


Let us describe in some details the calculation of burning rates (av) (see Ref. [115] for fur- 
ther details). Averaging over energies proceeds with the Maxwell—Boltzmann distribution, 


_ 2 e- E/T 
=F faa [ow dE, (8.62) 


where v is the relative velocity of colliding nuclei, M is reduced mass, E is kinetic energy 
in the center-of-mass frame, o(/) is the cross section of the reaction of interest; processes 
2 — 2 are by far dominant. 

If one of the particles is a neutron, Coulomb barrier is absent. Assuming the s-wave 
reaction, i.e., @ ~ vT}, the neutron reaction cross section far from resonance energies is 
natural to write as follows, 


The function R(E) depends on E weakly in the interesting energy range Æ < 1 MeV, so 
that it can be approximated by a few terms of the Taylor series in velocity, i.e., VE, 


R(E) = 


n=0 


R& (0) pn? 


n! : 


where R™ (0) are determined by fitting experimental data at low energies. Then the 
integral (8.62) can be evaluated analytically, and the neutron burning rate in the channel 
of interest is given by 


If the reaction has resonance character (like the reaction "Be(n, p)” Li which has resonances 
at Er ~ 0.32 MeV and Er ~ 2.7 MeV), then in the case of isolated resonance the cross 
section in the resonance region has the Breit—Wigner form, 


m (2J+1)(14+ 6:;) Tin(E)Tout(E) 


(2) = OME QJ: +1) QJ; + 1) (Œ Er) + (Pr/ 
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where Er and Ip are energy and width of the resonance, Ji, J; and J are total angular 
momenta of the initial nuclei and the resonance, and Tin(£) and Fout( E) are partial decay 
widths of the resonance state into initial and final states, respectively. The functions Din (E) 
and T'oue(#) are also determined from experiment. In the narrow resonance case, [ < Ep, 
the integral (8.62) is well approximated by 


3/2 .) T 
(ov)(T) = 2m (2J + 1) (1 + bij) Taal ous i e ERIT, 
MT) QL+D(2J;+1) Tr 


If both incident nuclei are charged, the important phenomenon is the exponential 
Coulomb suppression. As we have seen, the exponent is determined by the Sommerfeld 
parameter 


(8.63) 


where Eg is known as the Gamow energy. The cross section is conveniently represented as 


s(n = EE ea, 
Assuming that S(E) is a polynomial in EF, the integral (8.62) can be evaluated by the 
saddle point method, which gives 


for _3(2g)'/3 
(ov)(T) = ay ee 8(zF)  . So(Eo), (8.64) 


ner p Ver 2i Ty” 
ee E; is (2E, i 


are the saddle point value and width, while So(Eo) is a polynomial in (a0/Eo)? œ 
(T/E,)‘/°. The latter function can be determined from experiment. Note that the saddle 
point parameter oo/Eo = (T/E,)'/° is fairly large, so obtaining good accuracy requires 
employing high order polynomials. In practice one often makes use of other semi-analytical 
approximations to compute the rates (ov). As an example, reactions T(D,n)*He and 
3He(D, p)*He proceed through resonances. Even though these resonances are wide, the 
resonance contributions can be approximated by (8.63). The latter approximation is quite 
good numerically, while the resonance contributions turn out to be dominant in the tem- 
perature range of interest. We have used this approximation in the text; this is why the 
temperature dependence of the rates given in (8.53), (8.58) is different from the dependence 
that would follow from (8.64). 

Thus, the calculation of the rates (av) involves, in an important way, experimental 
data on nuclear reactions. In some cases, including the reactions p(n, y)D, D(p,7)*He, the 
data are scarce, which leads to uncertainties in predictions of element abundances. The 
first reaction, p(n, y)D, is, however, well understood theoretically, and this knowledge is 
often used in real calculations. 

We mention in the end that effects due to excitations of nuclei and Debye screening of 
nuclei by free electrons are not very relevant for BBN. 


where 


8.4. Comparison of Theory with Observations 205 


8.4. Comparison of Theory with Observations 


BBN theory is well-developed. Numerical analysis gives precise predictions for light 
element abundances in primordial plasma. These predictions are tested by mea- 
suring the chemical composition of matter in those places in the present Universe 
where the composition is thought to be primordial despite evolution. 

The evolution effects are very strong in most places in the Universe. Primor- 
dial matter is processed in stellar thermonuclear reactions occurring in the recent 
Universe, at z ~ 0—10. Some of the primordial nuclei transform into heavier ele- 
ments, while others are destroyed by hard y-quanta emitted in the star formation 
processes. These y-quanta destroy also heavier elements already produced, so that 
more light elements appear. All these processes lead to considerable changes in the 
light element abundances as compared to the primordial ones. 

In some regions of the Universe, however, local abundances of some elements are 
thought to remain unchanged. These are regions with low star formation rate: very 
distant (high redshift) regions where star formation has not happened yet and/or 
low-metallicity regions. The latter can be found by analyzing absorption spectra. 

Deuterium is special among light nuclei: it has very small binding energy and 
hence is not produced in stellar nucleosynthesis; it predominantly gets destroyed. No 
substantial sources of deuterium are known, so any measurement of local deuterium 
abundance sets a lower bound on its primordial abundance. Recently, deuterium 
abundance has been determined by spectroscopy of high-z, low-metallicity clouds 
which absorb light of distant quasars. 

Primordial helium-4 abundance is measured by spectroscopy of low-metallicity 
clouds of ionized hydrogen in dwarf galaxies. Production of helium-4 in stars is 
accompanied by production of heavier elements, metals in astrophysics terminol- 
ogy, so absence of the latter in clouds suggests that helium-4 is mostly of BBN 
origin there. 

Lithium-7 abundance is determined by spectroscopy of low-metallicity old stars 
in globular clusters of our Galaxy. 

No regions in the Universe have been found so far, where He abundance could 
be measured and where ?He would be mostly of primordial origin. Primordial abun- 
dance of this element is not as sensitive to the parameter 7, as that of deuterium, 
and measurements of ?He tell more about evolution of stars and Galaxy than 
about BBN. 

Spectroscopic measurements of relative local element abundances are quite pre- 
cise by themselves. The major uncertainties are systematic and, roughly speaking, 
have to do with limited confidence on the primordial origin of these abundances. 
The predictions of BBN theory together with observational data [1] are shown in 
Fig. 8.5. These results are in overall agreement with each other and with the value 
Np = 6.05 - 1071? obtained from CMB data, although systematic uncertainties are 
still pretty high. 
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Still, the data is precise enough to make an impact on particle physics. First, as 
we have already discussed, BBN imposes limits on the density of new relativistic 
particles at T ~ 1 MeV; see (8.17). 

Second, there should be no decays or annihilations of new heavy particles with 
emission of numerous hard photons at BBN epoch and somewhat later. If the main 
decay channel of a heavy particle X is 


X > YY, 


where Y is a particle with my « mx, then the energy of produced photon is 
E, + mx/2. It is the emission of these photons that is dangerous for BBN. There 
are two effects. The first one is the destruction of light elements by the hard photons. 
Particularly sensitive to this destruction is deuterium. The second effect has to do 
with the disintegration of helium-4. Even though this nucleus is tightly bound, hard 
photons may well break it up into lighter nuclei. Since helium-4 is very abundant, 
this would lead to overproduction of lighter elements, notably, He. The corre- 
sponding bounds [116] in the space of parameters Tx and Çx, where Tx is X-particle 
lifetime and 


are shown in Fig. 8.6. 
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Fig. 8.6. Model-independent BBN bounds on models with long-lived particles decaying into high 
energy photons [116]. Models with parameters above solid lines are ruled out by measurements of 
light element abundances. Upper lines corresponds to the most conservative bounds. 
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Examples of models with X- and Y-particles include some supersymmetric 
extensions of the Standard Model, with the decaying X-particle being, e.g., neu- 
tralino and stable Y-particle being gravitino (see Sec. 9.7). Similar bounds [117] 
exist on models with long-lived particles decaying into hadrons. 

Yet another source of bounds on extensions of the Standard Model is the observa- 
tion that there should be (almost) no entropy production at BBN epoch. Otherwise 
the nucleosynthesis temperature would be different, and the abundance of helium-4 
would be inconsistent with observations. 


Chapter 9 


Dark Matter 


As we have discussed already, large contribution to the total energy density in the 
present Universe (about 25%) comes from dark matter which consists, most likely, 
of new massive particles absent in the Standard Model of particle physics. These 
particles must be non-relativistic and should have practically no interactions with 
photons.! The latter requirement comes from the observations showing that dark 
matter halos are much larger than baryon parts of galaxies, meaning that dark 
matter experiences very little, if any, photon cooling (see Fig. 1.7 in Chapter 1). 

Clearly, dark matter particles must be very long lived. The minimal requirement 
is that their lifetime exceeds the age of the Universe. The point is that the dark 
matter density is known both at the present epoch (from mass distributions in 
clusters of galaxies and other data) and at early epoch of intense structure formation 
and even earlier, at recombination. By comparing these data one concludes that the 
variation of dark matter density in comoving volume, if any, must be small. If dark 
matter particles decay into Standard Model particles, then the limit on lifetime is 
typically much stronger: null (for now?) results of searches for photons, positrons, 
antiprotons, neutrinos produced in decays of dark matter particles imply that the 
decay rates through these channels exceed the age of the Universe by many orders of 
magnitude. In such a situation a natural assumption is that the dark matter particles 
are absolutely stable. In elementary particle physics, stability is often guaranteed 
by the existence of a conserved quantum number (or several numbers). Therefore, 
an explanation of the dark matter phenomenon requires introducing a new particle 
and, likely (though not necessarily), a new conserved quantum number. 


lAn interesting possibility is that dark matter particles experience fairly strong interactions 
between themselves. Their long-ranged interactions would lead, among other things, to formation of 
spherical halos, while observationally most halos of clusters of galaxies are ellipsoidal. This contra- 
diction is avoided provided that the mass and cross section obey [161] ¢/Mx « 107?4 cm?/GeV. 
At the same time, we mention that considerable elastic cross section of dark matter particles would 
be helpful in explaining the distribution of dark matter in galactic centers: numerical simulations 
of non-interacting dark matter predict cuspy profiles, with strong increase of dark matter density 
towards the center. These cusps apparently are not observed. 


209 


210 Dark Matter 


In this Chapter we consider several mechanisms of the dark matter generation 
in the Universe and some extensions of the Standard Model which contain dark 
matter particle candidates. Let us make an important comment right away: none 
of these mechanisms explains approximate (valid within a factor of 5) equality 


Ps,0 ~ PDM,0; (9.1) 


where pz, and ppm are energy (mass) densities of baryons and dark matter in the 
present Universe. This approximate equality was also valid at earlier stages, since 
the times dark matter and baryon asymmetry were generated. Several suggestions 
have been made in literature on how mechanisms of baryon asymmetry and dark 
matter generation may be related and lead to (9.1), but none of them appears 
compelling. The approximate equality (9.1) may be accidental indeed. 


9.1. Cold, Hot and Warm Dark Matter 


Let us make, for the sake of concreteness, a rather natural assumption that dark 
matter particles X were in kinetic equilibrium with conventional matter in the early 
Universe.” At some moment of time these particles get out of equilibrium and since 
then they propagate freely. If the corresponding decoupling temperature Ty is much 
smaller than the mass of the dark matter particle Mx, these particles decouple being 
non-relativistic. In this case dark matter is cold. In the opposite case, Ty Z Mx, 
there are two possibilities, My {£ 1eV and Mx = 1eV. The former corresponds to 
hot dark matter: its particles remain relativistic at matter-radiation equality (recall 
that equality occurs at Teg ~ leV, see Sec. 4.4); this is the case, e.g., for neutrino, 
as we have seen in Chapter 7. In the latter case dark matter is called warm: it is 
non-relativistic by equality epoch. We see in the accompanying book that density 
perturbations grow differently at radiation domination and matter domination, and 
that this growth strongly depends on whether dark matter is relativistic or not at 
equality. This is the reason for distinguishing hot and warm dark matter. 

One effect specific to hot and warm dark matter is as follows. Let dark matter 
have primordial density perturbations and its particles be free and relativistic in 
the temperature interval Ty = T = Mx. At that time dark matter particles escape 
potential wells and fill in underdense regions of sizes up to current horizon size. Due 
to this free streaming, dark matter density perturbations of these sizes get washed 
out. Hence, hot and warm dark matter have low amplitudes of density perturbations 
at relatively short scales. 

Free streaming terminates at T ~ Mx. The horizon size at that time, stretched 
by a factor (1+ z) = T/To, is the present maximum size of suppressed density per- 
turbations. In the warm dark matter case the equality T ~ Mx occurs at radiation 


?This assumption, in fact, might not hold, some examples are given in Sec. 9.4.2-9.8. 
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domination, so the horizon size at that time is 
Mh | Mh 
x . 
T? M2 
The corresponding present size is 
T M5, 


ly ~ . 
"To ToMx 
Thus, models with warm dark matter predict the suppression of density perturba- 
tions of the present size Io < lx,9. For My ~ 1keV we take g,(T ~ Mx) = 3.36 


(see (4.22)), so that M5, = Mpi/(1.66\/G.) = 4- 1018 GeV. Then Eq. (9.2) gives 


lxo ~ (9.2) 


Mx ~ 1keV: lxo ~ 3:10” cm = 0.1 Mpc, 
while 
Mx ~1eV: lx o ~ 100 Mpc. (9.3) 


Hence, models with hot dark matter predict the suppression of density perturbations 
of present sizes up to 100 Mpc. We refine these estimates in the accompanying book. 

In hot dark matter models, largest structures — superclusters of galaxies — get 
formed first, and then they fragment into smaller structures, clusters of galaxies. 
Galaxies are the latest objects in these models. This sequence of events is in strong 
disagreement with observations. 

Probably the best option is cold dark matter. Hot dark matter particles (e.g., 
neutrinos) should make a small contribution into the total dark matter density. 

Spatial size of order 0.1 Mpc is typical for perturbations that developed into 
small structures like dwarf galaxies.” The studies of structures of this and somewhat 
larger sizes gives the lower bound on the mass of dark matter particle, 


Mx > 1keV. (9.4) 


Similar bound comes from quite different argument based on the phase space density 
of dark matter particles in dwarf galaxies. We discuss this bound in some details in 
the accompanying book, and give numerical result in (7.34). 

Thus, warm dark matter is still a viable possibility. We emphasize that the 
above bounds on the mass of dark matter particle apply to dark matter which 
was in kinetic equilibrium with the usual matter at some early epoch. For non- 
thermal momentum distribution the estimate (9.4) gets modified by a factor of order 
(\p|)/(|p|)°?, where (|p|) and (|p])®? are actual average momentum and thermal 
one, respectively. It is worth noting that the comparison of CMB data and data 
on structures has the following general outcome: if dark matter particles were in 
kinetic equilibrium with the baryon—-electron—photon plasma, they decoupled at 


3Mass density in galaxies exceeds the average mass density by a factor 105—108. This means that 
matter in a galaxy clumped from a region whose size exceeds the size of the galaxy itself by a 
factor of 50-100. This leads to the estimate given in the text. 
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temperature Ta = 1 keV irrespectively of their mass. (We discuss this point in the 
accompanying book.) 


There exist model independent bounds on the masses of dark matter particles, which 
apply equally well to dark matter that had never been in kinetic equilibrium with the 
usual matter. Of course, these bounds are very weak. They come from the fact that dark 
matter particles must be confined in galaxies. For bosons the latter requirement implies 
that their de Broglie wavelength à = 27/(Mxvx) must be smaller than the dwarf galaxy 
size, 1kpc. Making use of the fact that velocities in galaxies are vx ~ 1074, we find 

Mx > 4-107” ev. 
The bound is much stronger for fermions, due to Pauli principle. Assuming Maxwell dis- 
tribution of dark matter fermions in galactic halo (this in fact is a reasonable assumption), 
we find for their phase space density 


re - | 


Mx (v 2r Mxvx)3 


where px(x)/Mx and vy are the number density and velocity dispersion of dark matter 
particles in a halo. The maximum of the phase space density as function of momentum 
occurs at p = 0 where f(p,x) is given by 


J™™ (p, x) = Px (x) il 


ME Om) 


This maximum value cannot exceed the maximum value allowed by the Pauli principle 
(see (5.4)), 


_ 9x 
Tf one 


Taking gx = 2,vx ~ 107° and p(x) ~ 0.5GeV/cm? (typical mass density in a halo) we 
obtain the bound 


Mx > 25eV. 


Stronger bound is obtained from the existence of dwarf galaxies. There, the mass density 
reaches ~ 15 GeV /cm?, which gives the bound 


Mx 2 750eV. 
We discuss this and similar bounds in the accompanying book. 
We note that there is also rather formal upper bound on the mass of dark matter 
particles, of order of a thousand solar masses (see, e.g., [120, 121]), 


Mx $ 10° Mo ~ 10% GeV. 


This bound comes from the stability of stellar clusters in the Galaxy, which would be 
destroyed by gravitational tidal forces induced by dark matter “particles” passing nearby. 
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9.2. Freeze-Out of Heavy Relic 


Let us turn to one of the most attractive scenarios of the dark matter generation. 
We will discuss concrete examples in the following Sections, and now we calculate 
the residual abundance of heavy relic particles in general form. 

To this end, let us consider the following situation. Let there exist stable heavy 
particles X and their antiparticles X which are in thermal (including chemical) equi- 
librium with cosmic plasma at sufficiently high temperatures. Let their interactions 
with the rest of the plasma be strong enough, so that they remain in equilibrium at 
temperatures somewhat below Mx. This assumption should, of course, be justified 
by the calculation of freeze-out temperature. Stability of the X-particle suggests 
that the X-particle can be created together with its antiparticle only. Let us ignore 
possible complications and assume that this is indeed the case. Finally, let us assume 
that there is no asymmetry between X-particles and their antiparticles, i.e., 


nx —Nx =0. (9.5) 


This assumption is very important: the results of this Section are not valid for 
the Universe asymmetric with respect to X-particles. Our task is to calculate the 
present abundance Nx of X- and X-particles.* 

Another possibility leading to essentially the same results is that X is a truly 
neutral particle, i.e., X coincides with X, while X-particles are produced and anni- 
hilate in pairs. In that case the condition (9.5) is satisfied automatically. We meet 
this situation in Sec. 9.6.1. In what follows we consider for definiteness theories with 
X and X. 

The number densities in thermal equilibrium at temperature T < Mx are 


3/2 
MxT 
n% =n = ~ eval, 9.6 
xX gx On ( ) 
Here we made use of the relation (9.5) to set the chemical potential of X-particles 
equal to zero. Under the above assumptions, the number of X-particles in comoving 
volume changes due to annihilation and pair creation only, 


XX — light particles. 


When the rate of annihilation is higher than the cosmological expansion rate, the 
abundance of X is given by the equilibrium formula (9.6). At some moment of 
time the pair production terminates: there is not enough light particles of energies 
of order Mx. The X-X annihilation terminates little later, when the the number 
density of X-particles becomes too small. At that time the abundance of X-particles 


4We note that the calculation is much simpler, if there is asymmetry between X and X in pri- 
mordial plasma. Assuming that X—X annihilation is fast, the cosmic medium at low temperatures 
contains only particles (assuming positive asymmetry). Their abundance is determined by the 
asymmetry nx = (nx — nx)/s. The present relative mass density is thus Qx = Mxsonx /pc. It 
does not depend on the annihilation cross-section in a wide range of parameters. 
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freezes out; after that, the number of particles is constant in comoving volume. We 
note in passing that kinetic equilibrium typically persists long after the freeze-out. 
The evolution of the number density of X-particles is governed by the Boltzmann 
equation (cf. Sec. 5.4), 
dnx ann 2 eq2 
a tT 8Hnx = —(o -vy (nk ng’). (9.7) 
Here (o®””™ . v) is the product of the annihilation cross-section and relative velocity 
v of X-particles, averaged with equilibrium distribution functions (asuming kinetic 
equilibrium) and summed over all annihilation channels. 
A simple way to obtain Eq. (9.7) is as follows. The annihilation probability of a 
given X-particle per unit time in medium with the antiparticle density ng = nx is 


Tann = (0 -v) + nx. (9.8) 
This gives for the decrease of X-particle number in comoving volume a® 
d 3 
H = Tann xe” =—(a%™ . v) nda’. (9.9) 


In thermal equilibrium, i.e., for nx = n%, this decrease is compensated by pair 
creation, so the increase of the X-particle number due to pair creation is given by 


jeo 


7 = (09. y) nA? a’, (9.10) 


| creation 
The total change in the X-particle number in comoving volume is given by the 
sum of (9.9) and (9.10), which leads precisely to Eq. (9.7). We emphasize that the 
contribution (9.10) is independent of the actual abundance of X-particles; the only 
condition of its validity is that other particles are in thermal equilibrium. 

The annihilation of non-relativistic particles often occurs in s-wave. In that case 
the velocity dependence of the non-relativistic annihilation cross-section is given by 
(Bethe’s law) 

Oann = a (9.11) 
where go is a constant which does not depend on velocity and which is determined 
by interactions responsible for annihilation. If the annihilation occurs in p-wave 
instead, then the cross-section has the form a°”” = gv. Contributions of higher 
angular momenta are additionally suppressed by powers of v. s-wave contribution 
is leading for non-relativistic particles, unless it is very small for some reason. 


Let us briefly remind the reader of the way the law (9.11) emerges. (Detailed analysis can 
be found, e.g., in the book [111].) It applies not only to the annihilation process but also 
to any inelastic s-wave reaction. The main property which is assumed to hold is that the 
relevant interaction is not long ranged, i.e., it occurs inside a region of a certain size a. As 
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usual, let us consider the flux of non-relativistic particles X incident on particle X at rest. 
Then the reaction probability per unit time is 


P = Ca*|h(a)|’, 


where (a) is the wave function of X-particles in the collision region near X-particle, and 
the constant C is determined by the details of the interaction. It is important here that 
particles annihilate in s-wave state, so that there is no centrifugal barrier. To obtain the 
cross-section, one divides the probability P by the flux of X-particles 


j= UVY" - vv). 


Away from the interaction region, the wave function is the plane wave of momentum p 
describing motion along the third axis, 


poe”, 


then the flux is equal to velocity v. Since the interaction is short ranged, the modulus 
of the wave function in the reaction region is |(a)| = const with velocity-independent 
constant of order 1. The result for the cross-section is 


P Ca? 
= + = const - —, 
lil v 


which is precisely (9.11). Note that the annihilation of heavy particles occurs with large 
energy release, AE ~ Mx, so the size of the interaction region is indeed small 
1 


eo 


Mx” 

It is worth noting that in the case of electrically charged particles of opposite charges 
(and in other cases of Coulomb-like attraction), the wave function in the reaction region 
may be considerably different from the asymptotic one (see the book [111] for details). 
This occurs at kinetic energies smaller than the binding energy of X-X atom, E < a? Mx. 
In this case the annihilation cross-section scales with velocity as o x 1/v?. This effect will 
be irrelevant in examples below. 

Additional factor v? in p-wave annihilation cross section emerges because of the cen- 
trifugal barrier. In the case of annihilation in the state of angular momentum / similar 
factor equals v”. 


It is sufficient for our purposes to consider s- and p-wave contributions to anni- 
hilation cross-section, so we write 


T 
(a9 v) = oo +01 (v7) = oo + 201 (0%) = oo + 670 (9.12) 
x 


where we use the fact that the average relative velocity squared equals? (v7) = 
2(v%), and that the average kinetic energy of non-relativistic particles is related to 


2 


5 Indeed, one has v? = (vı v2)" = v1 2v1 : V2 + v2°, and the cross term averages to zero. 
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temperature as follows: 


By a i 
(m) = w 3 (9.13) 

Let us come back to the Boltzmann equation (9.7). It follows from (9.6) that at 
high temperatures both terms in the right-hand side of Eq. (9.7) are large, chemical 
equilibrium is maintained and nx equals to n¥. On the other hand, at low temper- 
atures the term with n% is small, and X-particles get out of chemical equilibrium. 


Chemical equilibrium breaks down at temperature Tp, such that 
dn 

dt 

We accurately justify the relation (9.14) below, and here we point out that at 


T < Mx the most rapidly evolving function is the exponential factor in (9.6), and 
therefore 


m (oun. (9.14) 


dng _ Mx? «ą Mx 


d 72 XOT 


H(T)n, (9.15) 


where we make use of (3.34). Hence, the temperature at which chemical equilibrium 
breaks down obeys 


y, HT) ~ (own (9.16) 
f 
or 

M 

A = log [ox (27A, 66/7;) (Mx T)/2@Mpi(o%"" 0) | , (9.17) 


where (0°""v) = 09 +601T;/Mx. This equation has the form (6.18). The parameter 
(av) can be very roughly estimated as (07""v) ~ Mx”. This estimate is sufficient 
to understand that the logarithmic approximation works for Mx < Mý. Then with 
logarithmic accuracy we have 


ees Me )| W (9.18) 


EN 
We see that the temperature Tẹ of the exit from chemical equilibrium weakly (loga- 


rithmically) depends on the annihilation cross-section. This temperature is smaller 
than Mx by factor 


Ty = Mye ox ( 


Ls ow (Se) S , (9.19) 


This justifies our assumption that X-particles freeze out being non-relativistic. 
After creation of X-particles terminates (the term with n% in the Boltzmann 
equation (9.7) becomes small), the annihilation of X-particles continues. To study 
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this effect and develop an approach to the exact solution, we rewrite Eq. (9.7) in 
terms of X-entropy ratio. To this end, we introduce the variables 


eq 
_ nx eq _ x 
S S 


where s = (277/45)g,T7° is the entropy density at temperature T. Using the entropy 
conservation in comoving volume, Eq. (5.30), we write 


This leads to the following form of equation for Ax: 


dA 
a = — (ov) sgoe (A¥ —_ AY =) (9.20) 


It is now convenient to change the variable t to 


T 


x reat 
Mx 


This variable shows to what extent X-particles are non-relativistic (see (9.13)). In 
view of Eq. (3.34), Eq. (9.20) takes the form 


dAx _ (ay) 2 éq2 
a = He AAR] 


or, using explicit expressions for s = s(T) and H = H (T), 


dA A/T Gx 
— = (0o). E ‘My - Mri- (A, — A$1?). (9.21) 


Making use of (9.12), we get finally 


dA A/T Ja 
—~ = (oo + 601:2): WE MeMa (2 — 02), (9.22) 


This equation is suitable for numerical integration. We, however, pursue analytic 
approach and solve this equation at T < Ty approximately, neglecting the term with 
A% ? in the right-hand side of Eq. (9.22). This corresponds to switching off the pair 
creation of X-particles at T < Ty. At T = Ty we have nx ~ n$, where n% (Tp) is 
found from® (9.16): 


F MxT 
o E 
(omno) Mp, 


6We emphasize that inserting the temperature (9.18) into the expression (9.6) for number density 
would lead to large error, since temperature enters (9.6) exponentially, while the formula (9.18) 
has logarithmic accuracy only. 
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This gives the initial condition for simplified equation (9.22): 


ea (T L2 
Ax (af) © rab = ve —— (9.23) 
s(T$) STga Mx Mp; (a2 v) 


where L is large logarithm entering (9.19). 
Equation (9.22) without the term with AY is straightforward to integrate: 


AX (2) = [oo(a¢ — x) + 301 (a —«*)]- a - Mx - Mp, + AX" (zp). 


Because of the presence of square of large logarithm in (9.23), the last term in the 
right-hand side is small, so we obtain at low temperature, x — 0, 


3v5 1 


Ae Ae 9.24 
x(0) TJs Mx - Mpi: &f (0) eff an 
where 
(ov) eff = To + 3012p. 


This is our final result. Recall that the parameter x» = Ty /Mx is determined by 
Eq. (9.17) with logarithmic accuracy, 


T= L4 


A few remarks are in order. First, we observe that Ax(0)/Ax(xf) ~ L~1. This 
means that the annihilation of X-particles after exiting from thermal equilibrium is 
important: it reduces their abundance by a factor of L~!. Second, the result (9.24) 
could be obtained, modulo a factor of order 1, in a very simple way. Indeed, the 
annihilation terminates when the lifetime of X-particle with respect to annihilation 
becomes of the order of the Hubble time,” 


ann 


(0° vu)nx ~ H. (9.25) 
If annihilation is effective, the number density of X-particles freezes out at tem- 
perature close ot Ty, which gives (9.24). Finally, the result (9.24) is valid with 
logarithmic accuracy only, since it is obtained by neglecting n% in Eq. (9.22). To 
obtain exact value of Ax(T = 0), one has to solve Eq. (9.22) exactly. We do not 
need this solution in what follows. 


Problem 9.1. Show that using the approximation (9.25) gives exactly (9.24) in the 
case of s-wave annihilation. This coincidence is absent for p-wave annihilation. 


TThis condition is weaker than the condition for exit from chemical equilibrium: the rate of evo- 
lution of logn is higher than the Hubble rate at T ~ Tp; see (9.15). 
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After freeze-out, the number density nx changes only because of the 
cosmological expansion, and Ax does not change at all. So, the present number 
density of X-particles is 


nx (to) = soAx (0), 


where so = 2.9- 10° cm? is the effective entropy density at present; see (5.34). 
Then our result (9.24) gives for the present mass density of X and X 
Mxnx (to) 7.689 


= —__ ao (9.26) 


Qy =2 = f 
Pe Pct f (anny) eff Mri Gx (ty) 


and, with logarithmic accuracy, 


Gev~? ) 1 i ae 
roje) Santen On) ae 


Clearly, the most relevant parameter is the annihilation cross-section at temper- 
ature Ty. The dependence of Qx on Mx is logarithmic only, while the effective 
number of degrees of freedom gą does not change much during most of the history 
of the Universe. 

Our main emphasis in this Section is freeze-out of X-particles which has to do 
with breaking of chemical equilibrium. It is of interest, however, to figure out at 


OQxh? =1.8- 10-"( ) . (9.27) 


what temperature kinetic equilibrium breaks down. Indeed, one of our assumptions 
was that the distribution functions have equilibrium form. Kinetic equilibrium holds 
due to scattering of X-particles off light particles, so the time between collisions of 
a given X-particle is idependent of the number density of X-particles. Therefore, it 
is shorter than the lifetime with respect to annihilation 7,1; see (9.8). This means 
that kinetic equilibrium holds much longer than chemical equilibrium, i.e., it breaks 
down at temperature Thin < Ty. As an example, if X-particles participate in weak 
interacions, then the cross-section of their elastic scattering off, say, electrons at 
E « 100 GeV is estimated on dimensional grounds as 


Ge ~ GLE’. (9.28) 


The mean free time of X-particles is roughly 


Tel ~ (Ne Oel’ v)', 
where ne is electron number density and v is the relative velocity of X-particles and 
electrons, v ~ 1 at T >> 1 MeV. To estimate the temperature at which the kinetic 
equilibrium breaks down, we equate Te; to the Hubble time H~'(T’) and obtain (this 
estimate is similar to that made in Sec. 7.1) Trin ~ 1 MeV. This is of course a crude 
estimate but it shows that the kinetic equilibrium of X-particles breaks down fairly 
late. 
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Problem 9.2. Refine the estimate of Trin in the case when the cross-section of 
X-prticles off electrons is given by (9.28). Hint: make use of arguments given in the 
beginning of Sec. 6.3. 


Let us show that the exit from chemical equilibrium indeed occurs at temperature given 
by (9.16). To this end, instead of variable x = T/Mx we use the variable y = Mx /T and 
write the Boltzmann equation (9.20) as follows: 


= —— (Ov) -kx Mx Mpi: (AX -A pr 


where kK» = 2r? gx /45. Our purpose is to find temperature at which the relative abundance 
Ax starts to deviate from the equilibrium abundance AY. We write 


Ax = RAUL), 


The exit from equilibrium happens when 6 ceases to be small. Before that, the Boltzmann 
equation can be linearized, 
dé (2 (ov) Mx kx Mp, 


—-+ j? 


AX -1)6=1 2 


where we use the fact that due to strong dependence of the the exponential factor on time 
one has dAS!/dy = —A&, cf. (9.15). For relatively small y (high temperatures), the first 
term in parenthesis in (9.29) is large, therefore 


2 


y e =i 
ly) = ————_ [AZ . 9.30 
W) = seamen APU (9.30) 
(Note that dô/dy ~ ô, and the first term in the left-hand side of Eq. (9.29) is small.) We 
see that d(y) ceases to be small when y œ yy, where 


y? = (ov) Mx kx Mp AY (ys). 


This determines the temperature in question. In terms of physical quantities the latter 
formula gives 


ann M 
(0° v)neq(Ts) = *H(T)), 
f 


which yields (9.16). 


Problem 9.3. By explicitly solving Eq. (9.29) check the relation (9.30). Show that the 
initial value 6(yi) is rapidly washed out, provided that yi > yr: for any initial value of nx 
the number density of X -particles rapidly become thermal. 


Let us now give the general discussion of the Boltzmann equation which is widely used 
in cosmology. 

In general, the system of Boltzmann equations describes the balance of interacting 
particles, and Eq. (9.7) is a particular example, applicable to particles of a certain type 
which are pair created and annihilate. More accurate treatment of this situation is as 
follows. Let us consider for diversity the case of intrinsically neutral particles which can 
be created and annihilate in pairs (this possibility was mentioned in the beginning of 
this Section), and study these processes in Minkowski space. Let p1, p2 be 3-momenta of 
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incoming particles. The number of particles in the interval of momenta (p, p + dp) in the 
volume element d?x is 


dN = n(t, x) F(t, p)d°xd*p, 


where n(t, x) is the number density, and the distribution function F(t, p) is normalized at 
each moment of time as 


[Free =1. (9.31) 

Let us consider a particle of momentum p1. In unit time it annihilates with 
o-u-n(t,x)F(t, p2)d*pa (9.32) 
particles whose momenta are in the interval (p2, p2 +dp2). Here v is the relative velocity of 
the two annihilating particles and o = o(p1, p2) is the annihilation cross-section. This gives 


the annihilation rate in volume d?x for particles of momenta in the intervals (p1, p1+dp1) 
and (p2, P2 + dp2), 


1 
34N (p1: x) 0- v: n(t,x)F (t, p2)d°p2 
1 
= zr% x)F (t, pı)d’xd*pı -o -v - n(t,x) F(t, p2)d’pe, (9.33) 
where the factor 1/2 accounts for identical particles. (Otherwise the contribution of one 
and the same state (pı, p2) and (p2, pı) would be double-counted.) 
Since each annihilation reduces the number of particles by 2, the rate of decrease of 
particles due to annihilation equals twice the expression (9.33), 
n? (t, x)F (t, pı)d’xd*pı - o- v - F(t, p2)d’po. (9.34) 
There is also the inverse process of pair creation in collisions of other particles. The latter 
are in thermal equilibrium, and repeating the argument before Eq. (9.10), we write for 
pair creation rate 


n°"? (t,x) F(t, pi)d?xd*pi -o - v - F(t, p2)d?p2, 


where n“?, F®4 are equilibrium quantities. Finally, we integrate over momenta p2 to obtain 
the equation for the number density in phase space volume d*pd?x, 


[Petare] apx 


: -{ [EDF p) 
— n° (t,x) F(t, pi) F" (t, p2)] v- 0 dpa }d'pid’x. (9.35) 


The right-hand side here is the collision integral. 
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This general equation is simplified in the case of interest, when the number density is 
homogeneous in space, 


n(t,x) =n(t), n tx) =n i 


while the distribution over momenta corresponds to kinetic equilibrium and is time- 
independent in Minkowski space, 


F(t, p) = F“ (p). 
Then one integrates Eq. (9.35) over momentum pı and obtains 


an(t) 
ðt 


= —(ov)(n? — n°”). (9.36) 
Here we introduced the notation 
(ov) = [pid pak" (p1) F“ p2) eW (9.37) 


which is the thermal average of the relative velocity and annihilation cross-section (recall 
the normalization (9.31)). Note that the right-hand side of (9.36) is such that the system 
relaxes to thermal equilibrium for any sign of (n — n°1). 

In the case of annihilation of non-relativistic X-particles into relativistic particles, the 
normalized equilibrium distribution function is 


I e 


E 
aE 
Fe 
(P) = Get mT Ka)’ 


where E = \/p?+m* is particle energy and K2(«) is modified Bessel function. The 
integral in (9.37) is conveniently wtitten in terms of the Mandelstam variable s = (pi+p2)*. 
To this end, one replaces v- o by Lorentz-invariant quantity 


W (s) = 4E: Eav - o = 40\/ (pipe)? — m& = 20 (s)4/ s(s — 4m). 


Then one inserts unity 


- j dsô(s — (pı + p2)”) 


in the integrand and integrates over angular variables. The non-trivial integral over the 
angle between 3-momenta is evaluated by making use of the above 6-function. As a result 
one obtains 


E,+E2 


il Fi 
= i i 
(ov) [oT mk KEE) fiwa far dE2e 


The integral over energies is evaluated by introducing variables H+ = FE; + E2. Then 


E,+E 4m2 oo E 
f imame aa =2y1- T / dE, [E2 — me 
8 Jy 
[fm ee Js 
= 27 s — 4m} Kı (£ n 
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and finally 


_ 1 a L2 vs 
(ov) = aT mK (=) a ds W(s)\/s — 4m%, Kı ( T ) 3 


The Boltzmann equation (9.36) should be modified when applied to matter in expand- 
ing Universe. This modification takes into account the increase of the volume and redshift 
of momenta, and is given in (9.7), see also Sec. 5.4. 
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Let us discuss the possibility that new stable heavy particles make dark matter. 
Then the formula (9.27) is used to find their parameters (annihilation cross-section 
in the first place). The dark matter density today is Qpm œ~ 0.25. To estimate the 
annihilation cross-section we set co ~ 1/M?% in the argument of logarithm in (9.27), 
on dimensional grounds. Setting Mx = 100 GeV and gą = 100 for the estimate, we 
obtain the numerical value of the logarithmic factor in (9.27), 


* 


gx M;,Mxoo ] gx M>, 30 


l ~N — 
08 (On)372 °8 am )8/2Mx 


Since logarithm is a slowly varying function, this estimate is valid for wide range 
of masses Mx and cross-sections do. As an example, one finds the value of (ov) 
from Eq. (9.27) and obtains for Mx = 100 GeV 


Mé M ann 
ie ea) won: 


CE (9.38) 


The uncertainty in the parameter \/g.(ty) is also moderate: at T = 100GeV 
and T ~ 100MeV we have, respectively, \/g.(T) ~ 10, and \/g.(T) ~ 3 (see 
Appendix B). Thus, formula (9.27) gives the following estimate for the annihilation 
cross-section of cold dark matter candidates, 


1.8-10-!°. 20 
(3 = 10) -0.25- h? 


(oo ) Aa 


GeV~? = (0.3 + 1.0) - 1078 GeV~? 


= (1.1 + 3.7) - 107° cm’, (9.39) 


where the lower value refers to higher mass Mx, an is, therefore, more realistic. 
In the case of s-wave annihilation this estimate gives the parameter oo in (9.11). 
Notably, this value is comparable to weak interaction cross-sections at energies of 
order 100 GeV, namely ow ~ afy/Mĝ; ~ 1077 GeV-?. 

The result (9.39) has several important consequences. One is that it gives a 
cosmological lower bound on the annihilation cross-section of hypothetical heavy 
stable particles that may be predicted by extensions of the Standard Model. If the 
cross-section is below the value (9.39), mass density of these particles is unaccept- 
ably high. The main assumption behind this bound is that X-particles were in 
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Fig. 9.1. Annihilation of stable hypothetical X-particles into Standard Model particles fı and fo. 


thermal equilibrium at some early epoch. We note here that there is a bound on 
the annihilation cross-section 
o —_. A 

oS aR (9.40) 
In perturbative regime, this bound comes from the fact that the annihilation is 
described by diagrams like that shown in Fig. 9.1. For slow X-particles, the virtual 
particle Y has energy E = 2Ex = 2Mx in the center-of-mass frame. Its propa- 
gator gives the factor 1/M?% in the cross-section. Furthermore, there is additional 
suppression due to small coupling constant. 

The bound (9.40) may actually be violated in strongly coupled theories. This 
occurs when X-particle is a bound state of elementary particles, and the size of 
this bound state is large compared to the Compton wavelength. An example here 
is given by the proton. It is difficult, however, to imagine that the bound (9.40) is 
violated by many orders of magnitude. 

With this reservation, one makes use of (9.39) and (9.40) to obtain the cosmo- 
logical upper bound on the mass of stable particles, 


Mx < 100 TeV. (9.41) 


This bound is valid, if their interactions are sufficiently strong and the maximum 
temperature in the Universe exceeded My /20; see (9.18) and (9.38). 


Problem 9.4. Let us extend the Standard Model by adding new real scalar field 
X which interacts with the Englert-Brout—Higgs doublet H only. Let us add the 
following term to the Standard Model Lagrangian, 
i 2 
AL = 19, X0" X — ÉH'H X? —™ x2. 

2 2 2 
The discrete symmetry X — —X ensures the stability of the scalar particle X, so it 
is a dark matter candidate. Find the range of parameters (m, K) in which X -particles 
constitute all of dark matter. 


Let us mention that there is recent activity in discussing the possibility that all 
dark matter or its substantial part consists of much heavier particles. This scenario 
is realistic provided that these particles were never in thermal equilibrium and 


Sif My > Mx, then the propagator of Y-particles is suppressed by M;?, which makes the 
bound (9.40) even stronger. 
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were created in small number in the early Universe. We will briefly discuss possible 
mechanisms of superheavy particle creation in Sec. 9.8.1. 

By far more interesting is the possibility that X-particles are dark matter parti- 
cles. These dark matter candidates are called weakly interacting massive particles, 
WIMPs. The estimate (9.39) suggests the energy scale of X-particle interactions, 
Tg 1/2 10 TeV. In fact, this scale is somewhat lower, since the annihilation cross- 
section is suppressed by coupling constant in realistic models, which we denote 
as ax. Assuming that the energy scale of X-particle interactions does not much 
exceed their mass, we get 


a2 


oo N Me (9.42) 
As an example, for ax ~ 1/30 (W-boson coupling in the Standard Model) we obtain 


from (9.39) the following estimate: 
Mx ~ 200—400 GeV. (9.43) 


This estimate shows that there is a real chance to discover dark matter particles at 
colliders. The relevant processes at hadron colliders are quark—antiquark annihila- 
tion into dark matter particles plus visible Standard Model particles, and creation 
of dark matter particles in gluon—gluon collisions. The signature of dark matter is 
missing transverse energy (energy flowing transverse to the incident beam direc- 
tion), which is carried away by undetected dark matter particles. The diagrams of 
this sort are similar to that shown in Fig. 9.1, if the latter is looked at from right 
to left and emission of a Standard Model particle (say, gluon or photon) is added 
in one of the legs fi, fa. 

If intermediate paticles Y are much heavier than X, then low energy physics is 
described by effective contact interaction between dark matter and Standard Model 
particles, XX fı f2. The precise structure of this interaction is model dependent. 
Search for dark matter particles gives limits which can be interpreted as limits on 
elastic cross-sections of scattering of the dark matter particles off nucleon, if fı and 
f2 are gluons or first generation quarks. The diagrams for this scattering are again 
similar to Fig. 9.1, but now viewed from top to bottom (or from bottom to top). 
Figure 9.2 shows the results of the searches for dark matter interpreted in this way. 

It is worth noting that unlike the estimate (9.39) for the annihilation cross- 
section, our estimate (9.43) for the mass is very crude. Indeed, the dark matter 
particle may well be lighter or heavier than given in (9.43). Accurate calculations 
in concrete models show that the mass of the dark matter particle may be consider- 
ably smaller than 100 GeV, which makes collider searches even more promising. We 
also note that our estimate is particularly interesting from the viewpoint of super- 
symmetric extensions of the Standard Model which often predict the existence of a 
stable particle of mass in 100 GeV range; see Sec. 9.4.2. 

Thus, there are good reasons to expect that dark matter particles will be found 
in reasonably near future. Experimental determination of their properties, together 
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Fig. 9.2. Excluded regions in parameter plane (Mx,c@px) [162] for interactions independent of 
nucleon spin (left) and dependent on nucleon spin (right). Regions above the curves are excluded at 
90% C.L. CMS: search at Large hadron collider (under assumption of contact interaction X X fi f2, 
see the main text); IceCube and Super-K: search for a signal from dark matter annihilation in 
the Sun; other data: direct search for elastic scattering of cosmic dark matter in low background 
detectors. Shaded region in the central part of right figure corresponds to possible signal announced 
by CDMS experiment. See Fig. 13.16 for color version. 


with the precise calculation of their abundance and comparison with observational 
data will then provide a handle on the Universe at temperatures of order of tens 
GeV and age of 107°s. This is to be compared with the present knowledge: the 
Universe has been probed at BBN epoch, T ~ 1 MeV, t ~ 1s; see Chapter 8. 
Search for dark matter particles is underway, but no conclusive evidence has 
been obtained so far. Direct search for relic WIMPs is carried out in experiments 
trying to detect energy deposition in a detector caused by elastic scattering of a 
WIMP off a nucleus. Dark matter, like the usual matter, is more dense in galaxies; 
it is expected that its mass density near the Earth is similar to that of usual matter, 


GeV 
local 
~ 0.38 —. 
PDM m 
The velocity of dark matter particles is about vx ~ 0.5 -107 (orbital velocity 
around the Galactic center at the position of the Sun). Hence, the energy deposition 


in collision with a nucleus of mass M, is estimated as 
2 
AE ~ 1 2uxM,My, = 2M,v2 M,/Mx 
2M4 \ Ma+ Mx (1+ M,/Mx)? 


1 
(F M/M)? 


(9.44) 
=2M,v2 


The second of these formulas shows that the best choice of the target nucleus for 
given Mx is M, = Mx, while the third implies that for given detector material, the 
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energy deposition grows as My increases. Numerically, 


min(M,, Mx) ) ? 100 GeV T 


AE ~ 50. 
a ( 100 GeV M, 


We see that the energy deposition is of order of tens keV, which is very small. 
Nevertheless, there are methods to detect it. The event rate is 


V ~vxynx: Na+ Oax, 


where gax is the elastic X-nucleus cross-section, vx is the relative velocity, nx = 
pga! / Mx is the local number density of dark matter particles and N, is the number 
of nuclei in the detector. As an example, for elastic cross-section 04x ~ 10738 cm? 
and X-particle mass Mx = 100 GeV, the number of events in a detector of mass 


10 kg filled with nuclei of atomic number A = 100 is of order 


GeV 1 ) (e - 1073 GeV - 104 


~ 0.5-1073- | 0.3 = - — 
5 ( cm? ` 100 GeV 100 GeV 


) -10738 cm? ~ 3-1078 s7}, 


(9.45) 
which is about 1 event per year. The absence of a signal gives bounds in the 
space of model-independent parameters — dark matter mass Mx and dark-matter- 
nucleon elastic cross-section gyx. To compare different experiments, one uses, 
instead of the cross-section oax on nucleus A, the cross-section on a single nucleon 
Opx. It is important at this point that the cross-sections aax is not equal to 
the sum of cross-sections on individual nucleons: for slow dark matter particle 
there is a coherence effect that increases the cross-section by a factor of order A. 
More precisely, for interaction independent of the nucleon spin there is a relation 
Opx = (oax /A*)(ms/m4)(mx + ma)?/(mx + mp)? (modulo nuclear form-factor 
that dumps the cross-section at large A), where the additional mass-dependent 
factor accounts for the fact that the eleastic cross-section is proprotional to the 
squared momentum transfer, or (for the nonrelativistic scattering) the energy depo- 
sition (9.44). Another important point is that the elastic cross-section may depend 
on nucleon and nucleus spin. Examples of limits in the parameter plane (Mx, 0px) 
are shown in Fig. 9.2. 


Problem 9.5. Find the density of relic massive neutrinos of the hypothetical fourth 
generation of the Standard Model, assuming that their main annihilation channel is 
s-wave annihilation through virtual Z-boson, and that their mass obeys m, > Mz/2. 
Estimate the elastic cross-section of scattering of these neutrinos off nuclei and find 
the limit on their mass from the data shown in Fig. 9.2. 


Besides the direct search, there are ways to search for dark matter particles 
indirectly. These include search for products of WIMP annihilation at present time. 
In this case too, no compelling dark matter signal has been observed so far. 

One of the promising indirect ways is to search for monoenergetic photons which 
are emitted in two-body annihilation processes XX — yy, XX — Zy. In this case 
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Fig. 9.3. Exclusion regions in the plane (Mx, o%¥%) [318] for dark matter particles annihilating 
through indicated channels. The light gray line is the cross-section at freeze-out in the early 
Universe. Regions above the lines are ruled out at 90% C.L. AMS-02: search for antiparticles; Fermi 
LAT: search for photons from dark matter annihilation to ut u~ and rtr (upper and lower lines), 
WMAP?7: limit from possible annihilation into charged lepton pair before recombination (upper 
range rtr”, lower range ete~ ), which would affect CMB. As an example, the uncertainty for the 
~ is shown as shadow region. See Fig. 13.17 for color version. 


channel ete 
the signal should be enhanced in the direction towards dense regions in and outside 
the Galaxy, such as Galactic center, Magellanic Clouds, Andromeda galaxy, etc. 
Also, charged particles produced in dark matter annihilation may emit photons in 
their own annihilation, via Compton scattering and synchrotron radiation. Another 
way is to detect antiparticles created in the annihilation: positrons and antiprotons. 

Figure 9.3 shows limits of the dark matter annihilation cross-section for different 
channels. We see that these channels cannot dominate at dark matter freeze-out in 
the early Universe for Mx > 100 GeV. We note, however, that the bounds actually 
depend on the assumptions on the dark matter distribution in galaxies, since the 
annihilation rate is proportional to number density squared. In particular, the signal 
is enhanced, if dark matter forms small clumps inside galaxies. 


Problem 9.6. Prove the last statement. 


The annihilation cross-section of dark matter particles may be quite different from the 
cross-section responsible for freeze-out in the early Universe. This happens, in particular, 
if the cross-section strongly depends on velocity of annihilating particles: velocities of dark 
matter particles in halos v, are lower by one-three orders of magnitude as compared to their 
velocities vz at freeze-out. If annihilation occurs from a state of anglar momentum /, then 
the annihilation rate in galaxies is suppressed by a factor (vp, /vx)?’ <1. On the contrary, 
the annihilation in galaxies can be enhanced by a light particle coupled to the dark matter 
particles. This particle induces the interaction between the two dark matter particles 
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in t-channel, which attracts one dark matter particle to another. Then the annihilation 
proceeds in two stages: at the first stage two dark matter particles approach each other 
forming a kind of bound state; at the next stage this state annihilates in s-channel (or in 
t-channel by exchange of a heavy particle) into a pair of light (Standard Model) particles. 
The annihilation cross-section of this process gets significant amplification at the first 
stage, where the mechanism similar to the Coloumb enchancement works as discussed 
in Section 9.2. For slow dark matter particles, vn < a’, where a’ is the analog of the 
fine-structure constant of new interaction, the amplification factor is ta‘/v, >> 1 for 
annihilation from s-channel initial state and behaves like œ 1/v;'*! for the initial state 
with angular momentum l. 


Heavy dark matter particles can also concentrate in the centers of the Earth and 
the Sun. This is possible if the interactions of dark matter particles with usual mat- 
ter is strong enough, so that these particles occasionally lose energy when passing 
through the Earth or Sun, and eventually get caught by them. As a result of the 
high concentration, the dark matter annihilation may be strongly enhanced. The 
annihilation often leads to production of high energy neutrinos, an example being 
XX — pv. These neutrinos propagate away without colliding with matter, and 
can be detected by neutrino detectors. Search for high energy neutrinos from the 
centers of the Earth and the Sun is being performed at underground, underwater 
and under-ice detectors, “neutrino telescopes”. Limits from these indirect searches 
are shown in Figure 9.4. 
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Fig. 9.4. Exclusion regions in the parameter plane (Mx,opx) from the data of neutrino tele- 
scopes [163] on dark matter annihilation in the center of the Sun. The annihilation channels are 
shown in the right part. Regions above the curves are ruled out at 90% C.L. These limits are 
obtained under the assumption of dynamical equilibrium between the capture of dark matter 
particles by the Sun and their annihilation inside the Sun. 
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Problem 9.7. Let us assume that there exist stable, electrically charged parti- 


cles X= much heavier than proton. Let us recall that baryons in the early Uni- 
verse are predominantly either protons or a-particles (“He nuclei). Mass fraction of 
a-particles is 25%. 


(1) Find the binding energy of an “atom” consisting of X~-particle and proton. 
The same for X-a atom. 

(2) Assuming that the number density of X~-particles is small compared to that 
of baryons, find out in which state they predominantly exist today: bound state 
with proton, bound state with a-particle or free state. 


(3) Assuming that X+-particles were in thermal equilibrium in the early Universe, 
and that the asymmetry between XT and X* equals nx, find the present mass 
density of relic X -particles as function of their mass Mx and the asymmetry nyx. 

(4) Place the bounds on the asymmetry nx making use of the fact that searches 
for anomalous heavy isotopes (“wild hydrogen” and “wild helium”) set upper 
bounds on the present mass density of these isotopes at the level [123, 124] 
Qx < 10710 . (10 TeV/Mx) at Mx < 10 TeV (see also [1, 124] for other mass 
intervals). 


9.4. *Other Applications of the Results of Section 9.2 


Before considering concrete particle physics models with dark matter candidates, 
let us give two examples of the use of the results of Sec. 9.2. 
9.4.1. Residual baryon density in baryon-symmetric Universe 


Let us find baryon (proton and neutron) number density after baryon—antibaryon 
annihilation in a toy Universe having no baryon asymmetry. In such a Universe, 
densities of baryons and antibaryons are equal to each other. We recall the baryon- 
antibaryon annihilation cross-section at low energies? 


oo © (100 MeV) ~?. (9.46) 
Using this value in (9.18), we find for freeze-out temperature 
Tp ~ 20 MeV, (9.47) 


where we calculated M$, with gẹ = 43/4 (photons, electrons, positrons and three 
neutrino species). Formula (9.27) gives the mass density of baryons in baryon- 
symmetric Universe, 


Qe 5-107. 


°Note that the cross-section (9.46) exceeds by an order of magnitude the bound (9.40). The reason 
is that baryons are bound states of quarks and gluons. 
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This is about nine orders of magnitude smaller than the baryon density in real 
Universe. Substantial amount of baryons in our Universe exists entirely due to the 
baryon asymmetry. 


9.4.2. Heavy neutrino 


As another example, let us obtain a cosmological bound on the mass of hypothetical 
heavy neutrino of the fourth generation. In the absence of mixing with other neu- 
trinos, such a neutrino does not experience charged current interactions with usual 
leptons, but participates in neutral current interactions. Let us assume that the 
heavy neutrino mass is not very large, my < Mz w. The analysis in Sec. 7 applies 
to light neutrino only, for which Tf >> m,. We consider here the opposite case 
in which neutrino annihilates being non-relativistic. In this case the annihilation 
cross-section is estimated as 


To ~ G2m?; (9.48) 


it does not depend on temperature. Using this cross-section in (9.27) we obtain the 
estimate for the present mass density of heavy neutrinos, 


2 
Q, = 0.55 (=) (9.49) 


where we set m, = 3GeV in the argument of logarithm and used the following 
value for the effective number of degrees of freedom, 


T 3 
gx =2: (1+8) +g: (2:4+2-3+3-4-3)=617; 


(photons, e*, *, three neutrino species, gluons and quarks!? u, d and s). 

We see that the existence of heavy stable neutrino of mass my < 3 GeV would 
be inconsistent with cosmology [125]. For a long time this had been the strongest 
bound on the mass of heavy neutrino. Present bound comes from the collider data 


on the Z-boson decay into undetected particles and reads m, > M,/2. 


Problem 9.8. Check that neutrino of mass m, = 3 GeV indeed freezes out being 
non-relativistic. Find the values of the neutrino mass at which this property no 
longer holds. Show, nevertheless, that cosmologically excluded is the entire mass 
range of stable neutrino 


20eVS m, $3 GeV, 


where the lower value has been obtained in Chapter 7, formula (7.10). 


10For my > 20GeV, freeze-out temperature exceeds 1GeV, and g+ receives contributions from 
t-lepton and heavy quarks. This modifies the estimate (9.49) only slightly. 
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9.5. Dark Matter Candidates in Particle Physics 


We discuss in the following Sections some popular models of particle physics contain- 
ing dark matter particle candidates.!! There are no such candidates in the Standard 
Model, so the cosmological data on dark matter require that the Standard Model 
must be extended. This fact is very important for particle physics. 

Among the candidates we discuss, the natural one is neutralino. It is automati- 
cally stable in many viable supersymmetric extensions of the Standard Model and 
its present abundance is automatically in the right ballpark. Other candidates are 
less natural in the sence that their present mass density may a priori vary within 
large margin, and the correct value is obtained by adjusting the parameters of a 
model. 

From the viewpoint of direct experimental searches, all candidates can be clas- 
sified by using two parameters only, the particle mass Mx and its elastic scattering 
cross section!” off nuclei o;,,,,. These parameters are shown in Fig. 9.5 for most 
popular candidates [164]. It is clear that the candidates have very diverse prop- 
erties. Search for candidates from different regions in this plot is performed, as a 
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Fig. 9.5. Parameter ranges for various dark metter candidates [164]: their masses Mx and elastic 
scattering cross-sections off nucleon Cint; 1pb = 10736 cm? (conversion cross-sections for axion 
and axino). Red, rose, blue and dark blue show hot, warm, cold and ultracold dark matter, respec- 
tively. See Fig. 13.18 for color version. 


11One of the candidates, massive sterile neutrino, is considered in Sec. 7.3. 
12Tn the case of axion and axino, the relevant parameter is the cross-section of conversion on 
nucleon into other particles, since this cross-section exceeds considerably the elastic cross-section. 
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rule, by making use of different detection techniques. In some cases, in particular, 
for very heavy and/or very weakly interacting particles (wimpzilla and gravitino, 
respectively), no realistic detection methods are known. 

The zoo of proposed dark matter candidates is huge. In what follows we do not 
intend to discuss even a sizeable fraction of inhabitants of this zoo. We leave aside 
numerous candidates which we think are rather exotic. These include primordial 
black holes, strongly interacting dark matter, axino (superpartner of axion), mirror 
matter and many others. 


9.6. *Stable Particles in Supersymmetric Models 


Supersymmetry (SUSY) is the symmetry between bosons and fermions. The sim- 
plest (so called N = 1) SUSY theories in (3 + 1) dimensions contain both particles 
and their superpartners, particles of opposite statistics whose spin differs by 1/2 
from that of particles. Superpartners participate in the same interactions as par- 
ticles. In other words, particles and their superpartners have the same quantum 
numbers with respect to gauge group of the theory, while coupling constants of 
other interactions (e.g., Yukawa) are related in a well defined way. Each vector 
particle (e.g., gluon) has spin-1/2 superpartner (gluino), while each fermion (e.g., 
quark) has scalar superpartner (squark). More precisely, the number of spin degrees 
of freedom must be the same, so there are two scalar particles (two states) per each 
quark (two spin states). 


Let us illustrate these properties using the supersymmetric generalization of Quantum 
Electrodynamics (QED). In fact, this was the first model illustrating the theoretically dis- 
covered supersymmetry [127]. QED itself is a theory of massive Dirac fermion 4% (electron) 
interacting with Abelian gauge field A, (photon). The QED Lagrangian is 


1 we, oe . = 
LQED = — yh + ivy" (On + ieda) — mow. 


SUSY QED also contains four scalar degrees of freedom (corresponding to four spin states 
of electron and positron), which are described by two complex scalar fields ¢4 and ¢_. 
These carry charges +1 and —1 of the gauge group U(1). There is an obvious contribution 
to the Lagrangian, 


Li = Dud. D" p+ — m4 pipt + Dud) D" G- — m2 go, 
Dup = (On F ieAy)os, 


where m+ and m- are masses of scalars. In the case of unbroken supersymmetry, these 
masses are equal to the electron mass, m+ = m- = m. (This is not the case if super- 
symmetry is spontaneously broken, see below.) There is also photino, a superpartner of 
photon. This is electrically neutral Majorana fermion (two spin degrees of freedom), which 
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can be described by left spinor Àz. Its free Lagrangian is 
Lo = iry" OuAr.- 


Besides the gauge interactions between the gauge field A, and charged fields, there are 
“supersymmetry completions”, the gauge-invariant Yukawa interaction between photino, 
electron and scalars, 


La =iV2er,w- bd —iv2edrAL -p4 + h.c., 


and self-interaction in the scalar sector, 


e * * 2 
La =- 0o - 61.6). 


The coupling constants of these additional interactions are related to the gauge coupling 
in a unique way. Unambiguous relations between various couplings is a general property 
of SUSY Lagrangians. 

N = 1 supersymmetric extensions of other (3 + 1)-dimensional field theories have sim- 
ilar properties. In particular, supersymmetry completion of the usual Yukawa interaction 
is interaction between scalars, and vice versa. 


If supersymmetry is unbroken, masses of particles are equal to masses of their 
superpartners. This makes unbroken supersymmetry phenomenologically unaccept- 
able: no superpartners of the Standard Model particles have been discovered so 
far. Hence, SUSY extensions of the Standard Model assume that supersymmetry 
is spontaneously broken. In that case masses of superpartners may be arbitrary. 
The negative results of numerous searches place lower bounds on these masses [1]. 
These bounds are roughly at the level Mg = 100 GeV + 1.5 TeV, depending on 
superpartner. On the other hand, there are hints from the theory! that the super- 
partner mass range is 30 GeV-3 TeV. In this regard, search for superpartners is one 
of important directions in high energy physics, with high expectations raised by the 
proton-proton collider LHC at CERN of the nominal total center-of-mass energy 
14 TeV. 

SUSY extensions of the Standard Model give rise, generally speaking, to pro- 
cesses with baryon and/or lepton number violation. No processes of this sort have 
been observed so far, and there are severe bounds on their rates. (The strongest 
one comes from non-observation of proton decay.) To get rid of these unwanted 
processes, one usually imposes a symmetry called R-parity which forbids baryon 
and lepton number violation at low energies. 

R-parity is a discrete symmetry, each particle can be either even or odd under 
it (positive and negative R-parity, respectively); R-parity of a state with several 
particles is a product of R-parities of all these particles. All Standard Model parti- 
cles (and also additional Higgs bosons required by SUSY) are R-even, while their 
superpartners are R-odd. Once all interactions conserve R-parity, superpartners are 


13The most important hint is the partial cancellation of quadratic corrections to the Higgs boson 
mass, which provides the solution, in a technical sense, of the gauge hierarchy problem (Why 
Mw <« Mp;?). Another hint is the gauge coupling unification. 
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produced and disappear in pairs; in general, a pair consists of different superpart- 
ners. R-parity conservation thus implies that the lightest superpartner — LSP — 
is stable. This is the new particle that serves as the dark matter candidate. Since 
electrically charged particles of mass 30 GeV-10 TeV cannot be dark matter (see 
Problem 9.7), the potential candidates in SUSY theories are neutralino, sneutrino 
and gravitino.'+ Let us discuss them in turn. 


9.6.1. Neutralino 


A popular dark matter candidate is the neutralino. In fact, most direct searches for 
dark matter are oriented towards WIMPs, while the neutralino belongs precisely 
to this category. There are three reasons for the popularity. The first is that the 
neutralino is predicted by SUSY extensions of the Standard Model and the lightest 
neutralino is a stable LSP in a wide range of parameters of the theory. Second, like 
for any WIMP, the neutralino mass density in the Universe is more or less automat- 
ically in the right ballpark. Third, the neutralino participates in weak interactions, 
so its elastic scattering cross-section, albeit small, may be sufficient for its direct or 
indirect detection in dark matter searches (see Fig. 9.5). 

Neutralino is a term used for electrically neutral Majorana fermion which is a 
linear combination of superpartners of Z-boson, photon and neutral Higgs bosons. 
There are four such combinations in the minimal SUSY extensions of the Standard 
Model. The point is that these theories necessarily contain two Higgs doublets, so 
there are two neutral higgsinos, superpartners of the Higgs bosons. The other two 
neutral fermions, photino and zino, are collectively called neutral gaugino. 

Like their Standard Model partners, neutralinos participate in gauge interac- 
tions. Provided that the lightest neutralino is LSP, its present abundance is crudely 
estimated along the lines of Sec. 9.2. We set gx = 2, g4(tf) ~ 100, and use the 
estimate (9.42) for the annihilation cross-section. Then the formula (9.27) gives 


T; ~ 9.50 
Pay) (9.50) 
10-3 / M ? 100 GeV 
Oy = 3-1074 [| et 1012. = 9.51 
EENM a (er) og( My ven) 

M 2 
x 0.8-1077- {| —~— 9.52 
von (ac): (152 


where ay œ 1/30 is the gauge coupling of SU (2)w interaction. For the neutralino 
in the interesting mass range 


100 GeV < My < 3TeV, 


14We restrict ourselves to minimal extensions of the Standard Model and leave aside other dark 
matter candidates like axino, singlino, etc. 
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our preliminary estimate is 
0.01 S$ Qn S10, 


which is indeed in the right ballpark. 

We are going to refine the estimate (9.52) in this Section. We will make a simpli- 
fying assumption that the neutralino is the only new particle present in the cosmic 
plasma at relevant temperatures. This is the case if the masses of other superpart- 
ners are sufficiently larger than the neutralino mass. In this case the only processes 
of interest to us are neutralino pair creation and annihilation. We note, however, 
that our assumption is not innocent. If other superpartners have masses close to 
that of neutralino, they also exist in the plasma at the epoch of neutralino freeze- 
out, and freeze-out occurs in a more complicated way: neutralino are created and 
annihilate in pairs with other superpartners, with possible resonant enhancement of 
some of these processes; also, neutralinos are created in decays of superpartners, etc. 
We will see that in some versions of the theory the latter phenomena are important 
and, in fact, they help to obtain the right neutralino abundance. 

Still, let us proceed under our simplifying assumption. Since the neutralino is 
an intrinsically neutral particle, i.e., it is its own antiparticle, the factor 2 in the 
first of Eq. (9.26) is absent. Therefore, the contribution of relic neutralinos into the 
present energy density is 


1 Gev~? 
Op x 0.9107 2 —_ Ml (9.53) 
Lpr/Gx To + 3012 fF 
We recall that the parameter x = T;/Mvw is the solution to Eq. (9.17), i.e., 
3-710 
az? = ln | —— an -Myn - Mp, - (oo + 6012 f) x (9.54) 


ST? /GnkF 


To obtain quantitative estimates, we have to find the parameters oo and g1 in terms 
of the parameters entering the action of the SUSY model. This is done in different 
ways for different neutralino masses. 

Our first example is the light neutralino, My < Mz, and we begin with very 
low masses, My « Mz. The region in parameter space where this option is viable 
is very restricted. Indeed, the kinematically allowed decay Z — NWN should not 
give too large a contribution into the well measured invisible Z-decay width. Hence, 
the lightest neutralino in this case must be predominantly bino (superpartner of 
the gauge boson of the weak hypercharge group U(1),-), with small admixture € of 
higgsino and the other neutral gaugino. 

One possibility is that the dominant annihilation channel for the very light neu- 
tralino is the s-channel annihilation into two b-quarks through exchange of virtual 
scalar or pseudoscalar particle A; see Fig. 9.6. In the minimal SUSY extensions of 
the Standard Model, A is parity-odd Higgs boson A. The annihilation cross-section 
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Fig. 9.6. Light neutralino annihilation. 


is estimated as follows, 


aM? 
(ov) ~ gM, 


A 


where y is the Yukawa coupling of b-quarks with the boson A. Sufficiently large 
cross-section is obtained for large yẹ ~ 1. In that case 


2 2 4 
E o (EN (_Mx \? (100GeV 7 
= oD (£) (ey) ( ma a 


We estimate the freeze-out temperature from (9.54) and obtain xf ~ 1/20. The 
neutralino mass density today is then found from (9.53). Numerically 


0.1\ /10GeV\* ma \4 
Qs ~ 0.2: (=) (=) (mx) . (9.55) 
We see that a very light neutralino of mass Mẹ ~ 10 GeV can indeed be a dark 
matter particle, provided that m4 ~ 100 GeV and €? ~ 0.1. 

This mechansim is problematic in the minimal SUSY extensions of the Standard 
Model. The lower limits on the mass of the axial Higgs boson A are in the range of 
several hundred GeV, depending on other parameters. So, the mechanism we discuss 
is typically insufficient for annihilating enough neutralinos in the early Universe. 
This mechanism does work, however, in models with extended Higgs sector (e.g., 
with extra scalar field which is a singlet under the Standard Model gauge group): 
these models contain more (pseudo)-scalar particles, some of which may be light 
enough. 

Let us continue with the case My < Mz/2. Another neutralino annihila- 
tion mechanism exists in models with light superpartners of the Standard Model 
fermions, Ms < Mz: this is the annihilation through t-chanel exchange of these 
superpartners. With Ms ~ Mz, the estimate of neutralino abundance is similar to 
(9.55) where one replaces (fy)? —> a and ma — Ms ~ Mz. Hence, neutralino of 
mass My ~ 30 GeV is indeed a viable dark matter candidate. The relevant super- 
partner S in realistic models is stau (superpartner of tau-lepton), and the dark 
matter neutralino is a mixture of bino and higgsino. 

Data from the Large Hadron Collider (LHC) suggest, however, that realistic 
SUSY models have mass hierarchy in the Englert-Brout—Higgs sector: one Hoggs 
boson has mass 125 GeV and properties close to those of the Standard Model 
Higgs boson, while others, including A, have much higher masses. Also, masses of 
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Fig. 9.7. Neutralino annihilation into the Standard Model fermions. 


superpartners of the Standard Model fermions are much greater than Mz. In that 
case very light neutralinos would be overproduced, as we see from Eq. (9.55). So, 
let us consider the case of heavy Higgs bosons and neutralino of intermediate mass 
Mz/2< My X Mz. The annihilation cross-section in this case is often suppressed 


as compared to the estimate leading to (9.51). Indeed, the neutralino in our case 
predominantly annihilates into the Standard Model fermions, 


NN > ff. (9.56) 


These processes occur via the s-channel Z-boson exchange, and also possibly via t- 
channel exchange of superpartners of the fermions; see Fig. 9.7. Since the neutralino 
is non-relativistic and its mass is smaller than that of the Z-boson, momenta of 
virtual particles in these diagrams are small compared to their masses, so they 
can be neglected. This means that the annihilation process (9.56) is described by 
effective four-fermion interaction 


=Y NPN: Falar + rrii, (9.57) 
7 


where the parameters aş and by have dimension GeV~?. These parameters are 
model-dependent. (In the case of Z-boson exchange they depend on the mixing 
matrix in the neutralino sector.) An order of magnitude estimate for the Z-boson 
exchange contribution is 


ar, by ~ Gr =1-10°-°GeV ®. (9.58) 


Superpartners of fermions are typically heavier than Z-boson, so the t-channel 
exchange diagrams give smaller contributions. 

The interaction (9.57) gives for the non-relativistic annihilation cross-section 
(recall that neutralino is Majorana fermion) to the leading orders in v? and m4 


oy 1 2 2 1 2 2 2 2 


where v is the relative velocity of the two incident neutralinos. This means that the 
parameters co and cı defined in (9.12) are equal to 


DD a= g De? 2 EJM. 
f 
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The s-wave annihilation is suppressed by the fermion mass: we see that co — 0 as 
my — 0. This is due to the following property. In the limit of massless fermions, 
only left-handed Standard Model fermions and right-handed antifermions interact 
with Z-boson. The projection of the angular momentum of such a pair onto their 
flight direction z is either J, = 1 or J, = —1. Non-relativistic spin-1/2 Majorana 
neutralinos should be in the state of angular momentum 1 (p-wave) to produce 


a pair with J, = £1: the state of angular momentum 0 (s-wave) and total spin 
J, = +1 (spins directed in the same way) is forbidden by the Pauli principle. 

Making use of the above estimates for the parameters ag and a, we find 
from (9.53) that 


Qy ~0.2 for My œ 60GeV. 


So, relatively light neutralinos are reasonable dark matter candidates. 

The problem with these scenarios, however, is that in concrete models, the neu- 
tralino mass is related to masses of other superpartners. The latter are strongly 
constrained by experiment, so in many models neutralino cannot be light. 

Thus, let us turn to the heavier neutralino. In that case, momenta in internal 
lines of the annihilations diagrams (e.g., diagrams shown in Fig. 9.7) are not small, 
Q? = 4Mz, so the cross-sections are generically suppressed by a factor of order 
M#/4M3% as compared to the case of light neutralino. With an account of p-wave 
suppression of annihilation into fermions this means that the annihilation cross- 
section is generically too small, i.e., heavy neutralinos are typically overproduced 
in the Universe. 

Heavy neutralinos, My Z Mz, still remain dark matter candidates in a certain 
(typically narrow) ranges of parameters of concrete models. The annihilation may 
be enhanced, e.g., by the presence of other superpartners in the cosmic plasma 
at neutralino freeze-out. This occurs when these superpartners are approximately 
degenerate in mass with neutralino. Whether this is a natural solution to the neu- 
tralino overproduction problem, is another issue. 


Problem 9.9. Consider minimal SUSY extension of the Standard Model in which 
all superpartners, except for sleptons and photino, are very heavy. Let the masses 
of sleptons and photino be of order 100 GeV, and photino is LSP. Neglecting slepton 
mixing, find the relic photino abundance. Assuming that one, two or three sleptons 
are light and have one and the same mass, find the range of slepton and photino 
masses in which photino is dark matter. Hint: The Lagrangian of photino interaction 
with leptons and sleptons is given in the beginning of Sec. 9.6. 


Even the Minimal Supersymmetric Standard Model (MSSM), which is a min- 
imal SUSY extension, contains many free parameters, in addition to those of the 
Standard Model. Fairly large parameter regions are already ruled out by experi- 
ments at the LHC and elsewhere. Yet there are parameter regions where neutralino 
serves as dark matter particle, WIMP. In most of these regions the elastic cross 
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Fig. 9.8. Predictions of MSSM for spin-independent cross-section of elastic scattering of neu- 
tralino off nucleons [165]. Color regions correspond to parameters consistent with accelerator data 
and yielding Qy =% 0.25. The blue region is ruled out by direct search experiment XENON 100. 
The neutralino annihilation cross-section is enhanced in green regions by resonant annihilation 
through Z-boson or one of the Higgs bosons of MSSM, when My ® Mz/2 or Mn % mq/2; 
see discussion below. The suppression of neutralino abundance in yellow and magenta regions is 
due to approximate degeneracy of neutralino and some other superpartner leading to efficient co- 
annihillation of the neutralino with this superpartner. Other mechansims that suppress neutralino 
abundance operate in gray regions; they require fine tuning of parameters too. Shown here are 
also the regions of possible signals in DAMA, CoGENT, CRESST and CDMS experiments and 
planned sensitivity of LUX and XENON 1T. See Fig. 13.19 for color version. 


sections of neutralino scattering off nuclei is high enough, so that cosmic neutralino 
can be detected in direct experimental searches, which we talked about in Sec. 9.3, 
see Fig. 9.2. This is further illustrated in Fig. 9.8. 


Let us describe a consistent and phenomenologically viable example with rather few new 
parameters, called mSUGRA (minimal Supergravity). The model assumes that supersym- 
metry is broken in some “hidden” sector, and this breaking is transferred to the Standard 
Model fields and their superpartners by gravitational interactions (for a review see, e.g., 
Ref. [129]). Then at certain energy scale M all scalars of the visible sector, irrespective of 
their quantum numbers, obtain one and the same SUSY breaking mass mo, while all gaug- 
ino (superpartners of gauge bosons) obtain one and the same SUSY breaking mass Mı/2. 
Also, trilinear interactions between the Higgs doublets and other scalars are generated; 
these are assumed to be proportional to the Yukawa couplings with one and the same pro- 
portionality coefficient A. These mass terms and trilinear couplings break supersymmetry. 
At the scale M this breaking is therefore universal. From the viewpoint of phenomenol- 
ogy, the advantage of this universality is that dangerous flavor changing neutral current 
interactions are suppressed. 

As we mentioned above, there are two Higgs doublets; one of them, Hy, interacts with 
up quarks while the other, Hp interacts with down quarks and charged leptons. They both 
obtain vacuum expectation values, so that there is one more parameter, 


(Hv) 
(Hp) 


tan 8 = 
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Finally, there is yet another, discrete parameter, which is called sign of u for technical 
reasons. !° 

In the simplest, and yet realistic models, the mass scale M is set equal to the Grand 
Unification scale Meur ~ 101° GeV [131]. At this scale all three gauge couplings of the 
Standard Model are equal (modulo normalization factor for weak hypercharge). We discuss 
Grand Unified Theories in Sec. 11.2.2, and here we only notice that this scale is very high. 

The class of models just described is often called constrained Minimal Supersymmetric 
Standard Model (CMSSM), while the term mSUGRA is reserved for even more constrained 
model (where, in particular, tan 8 is not a free parameter, and gravitino mass equals mo). 

Quantum corrections modify all mass parameters {mo;:, Ajj, A, Mi} and coupling con- 
stants {ai, Y;j,Y;7 } as the energy scale Q changes from M to Mz. This is described by 
renormalization group. At the scale Mz supersymmetry breaking is no longer universal, 
but all new (with respect to the Standard Model) parameters are expressed through M, 
mo, Mı/2, A, tan 8 and sign p. 

The renormalization group evolution in mSUGRA model [130] is shown in Fig. 9.9. 
Due to large color SU(3) gauge coupling, masses of squarks and gluinos run faster than 
other masses. The lightest among gauginos turns out to be bino. The situation is more 
complicated in the scalar sector, where the gauge and Yukawa interactions compete. As a 
result, the lightest slepton is a superpartner of T. 

A nice feature of mSUGRA model is that the electroweak symmetry breaking occurs 
automatically, due to quantum corrections. As Q decreases, one of the Higgs masses 
squared becomes negative, which means spontaneous symmetry breaking. This is shown 
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Fig. 9.9. An example of the renormalization group running of the mass parameters in mSUGRA 
model (plot by T. Falk from [130]). The universal parameters at the scale Meur ~% 1016 GeV are: 
Mıj2 = 250 GeV, mo = 100 GeV, A = 0, tan @ = 3 and sign u = —1. 


15Tn fact, the parameter u, the supersymmetric mass of the Higgs doublets, may be complex. This 
property serves as a potential source of C’P-violation in the Higgs sector. This is quite interesting 
from the viewpoint of the baryon asymmetry generation; see Chapter 11. 
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mSUGRA: A,= -m,,,, tanB=30, u > 0, m=173 GeV 
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Fig. 9.10. Experimentally forbidden and cosmologically favored regions in the plane (My ;/2, mo) 
in mSUGRA model [166] with Ao = mo and p > 0 (left) and CMSSM [167] with Ap = —my/2, 
tang = 30 and pu > 0 (right). Shown here are lines of constant mass of the lightest Higgs boson 
(realistic interval is 125-126 GeV. In the left panel, the elongated blue region is consistent with 
neutralino dark matter and has m 1/2 % 1300 GeV, mo ~ 850 GeV; the region above the blue strip 
and brown region below are cosmologically ruled out (neuralinos are overproduced and LSP is a 
charged superpartner of tau-lepton, respectively); below the brown region LSP is gravitino (see 
disussion in Sec. 9.6.3), the region to the left of solid green line is excluded by the LHC data; gray 
lines show the values of tan 8. In the right panel, the cosmologically allowed region is shown in red, 
in the black region where m 1/2 © 1800 GeV, mo © 7000 GeV neutralino abundance coincides with 
that of dark matter; in the green region LSP is the electrically charged superpartner of t-quark. 
See Fig. 13.20 for color version. 


in Fig. 9.9, where the mass of the Higgs field is given by the combination ym + p?. Since 
the renormalization group running is logarithmic in Q, electroweak symmetry breaking 
occurs many orders of magnitude below Meur for small u and SUSY breaking terms, 
which explains the hierarchy Mw < Meur. 

Given the tight experimental constraints on the parameters of mSUGRA, there are only 
three narrow regions in the parameter space where relic neutralino has right abundance to 
serve as dark matter; see Figs. 9.10 and 9.11. The reason is that the region in parameter 
space where neutralino is light is experimentally excluded. So, the light neutralino scenar- 
ios we discussed above are impossible. On the other hand, heavy neutralinos tend to be 
overproduced in the early Universe. The cosmologically favored regions in Fig. 9.10 are 
special in the sense that the neutralino annihilation cross section is considerably enhanced 
in one or another way. This way of obtaining the correct dark matter abundance does not 
appear particularly plausible, although the high-energy values of the parameters are not 
particularly contrived. 

One mechanism of enhancement of the neutralino annihilation cross-section exists for 
large tan 8 only. It occurs when the masses of scalar or pseudoscalar Higgs bosons (which 
are automatically almost equal in the relevant parameter region) are close to the mass of 
a pair of neutralinos 2My ~% my or 2mn ~% ma. Then the decay channel 


NN — A*, H* — Standard Model particles, 


has resonance enhancement. Of couse this occurs due to tuning of parameters. 
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Fig. 9.11. Same as in Fig. 9.10, left, but for model [166] with an additional parameter in the 
Higgs sector as compared to CMSSM. The magenta region in the left upper corner is ruled out 
by the requirement of spontaneous breaking of the electroweak symmetry. The neutralino is dark 
matter particle in the blue region. See Fig. 13.21 for color version. 


Two other mechanisms exist in models with co-LSP, where the next-to-lightest super- 
partner, NLSP, is almost degenerate in mass with neutralino LSP. In one case (strip in 
Fig. 9.10, left panel, along the cosmologically disfavored region with charged LSP) NLSP 
is the charged slepton (mostly right stau), while in the other case (narrow strip in Fig. 9.11 
along the region without electroweak symmetry breaking) it is the lightest superpartner 
in the quark sector, the superpartner of t-quark. 

Due to approximate degeneracy in masses, the freeze-out epoch is one and the same 
for reactions 


LSP + LSP — Standard Model particles, 


NLSP + LSP — Standard Model particles, 
NLSP + NLSP — Standard Model particles. 


The larger number of channels, and also resonant enhancement of ¢-channel annihilation 
reduces the freeze-out abundance of neutralinos. The extra channels are indeed important 
in a narrow region of the parameter space where 


MNLSP — MLSP Am 


$0.1. (9.60) 
MLSP MLSP 
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The latter estimate follows from the fact that NLSP density is suppressed with respect to 
LSP by the Boltzmann factor 


Am _99.—Am_.( ™isp/20 
Tf =e ™LSP Ty 
bi 


This factor is less than 15%, unless the inequality (9.60) holds. 


To conclude this Section we mention that in some models neutralino is long 
lived but not absolutely stable. The stable particles — dark matter candidates — 
may then be produced in neutralino decays. This helps to solve the overproduction 
problem, as the number of dark matter particles is the same as the freeze-out number 
of neutralinos, while the mass of the dark matter particles is smaller. We discuss 
this possibility in the context of gravitino dark matter, Sec. 9.6.3, since gravitino is 
the most interesting and realistic partner of neutralino in the pair LSP-NLSP. 


9.6.2. Sneutrino 


The sneutrino is a superpartner of the neutrino. SUSY models contain three gen- 
erations of sneutrinos described by complex, electrically neutral scalar fields. If the 
lightest sneutrino is LSP, one might wonder whether it can be a dark matter candi- 
date. Indeed, weak interactions of sneutrinos are the same as those of neutrinos, so 
the order of magnitude estimate of the present sneutrino mass density would coin- 
cide with (9.27). (Unlike in the case of the neutralino, there is no p-wave suppression 
of the annihilation cross-section.) However, the sneutrino as a dark matter particle 
is excluded by direct searches for dark matter. Indeed, due to weak interactions, 
the cross-section of elastic scattering of sneutrino off nuclei is two to three orders 
of magnitude larger than experimental bounds shown in Fig. 9.2. We note in this 
regard, that the sneutrino is LSP in a much narrower class of models as compared 
to the neutralino. 

A lot less problematic are right sneutrinos which exist in SUSY theories with 
right neutrino masses in the 100 GeV range. The right sneutrinos interact with the 
Standard Model particles very weakly, so their elastic cross-sections are well below 
the bounds shown in Fig. 9.2. 


9.6.3. Gravitino 


Consistent supersymmetric theories that include gravity are theories with local 
supersymmetry — supergravities. Supersymmetry is local in the usual gauge theory 
sense: the action is invariant under supersymmetry transformations depending on 
space-time point. In these theories the graviton has its superpartner, the gravitino 
G u- If supersymmetry were unbroken, the gravitino would be a massless spin-3/2 
particle with 2 polarization states, the same number as for the graviton. 
Supersymmetry, however, must be broken. In theories with spontaneously bro- 
ken global supersymmetry (no supergravity), the Goldstone theorem implies the 
existence of massless particles whose properties are dictated by the properties of 
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broken symmetry generators. In the case of supersymmetry, the generators are 
fermionic (Grassmannian) and carry spin 1/2 (they generate the transformation 
boson + fermion and change spin by 1/2), so the massless particle is a Majorana 
fermion w, goldstino. Like any Nambu-Goldstone field, goldstino interacts with 
other particles through its coupling to the relevant current, which is supercurrent 
in the case at hand, 


1 
L= z3: Joa (9.61) 


This is a direct analog of the Goldberger—Treiman formula which describes interac- 
tions of pions, the (pseudo-)Nambu-Goldstone bosons of broken chiral symmetry. 
The parameter F in (9.61) has dimension of mass squared and is determined by the 
vacuum expectation value that breaks supersymmetry. Hence, VF is of the order of 
the supersymmetry breaking scale (in complete analogy to chiral model where the 
pion decay constant fr is of order of the chiral symmetry breaking scale). 

Once supersymmetry is local, the super-Higgs mechanism is at work. The gold- 
stino becomes the longitudinal component of gravitino, 


~ ~ M 
Cy Gye ivan Out, (9.62) 
and gravitino acquires mass m3/2 proportional to F. The relation is 


8r F 


Phenomenologically, the value of VF may be anywhere in the interval 
1TeV S$ VF S Mpi, 


where the lower bound corresponds to the superpartner mass scale. Hence the wide 
range of possible gravitino masses, 


2-10-*eV S msjo S Mpi. 


The formulas (9.61) and (9.62) where w is the longitudinal component of gravi- 
tino, imply that gravitino interactions with matter have in principle a rather simple 
form. Indeed, the interactions of “original” gravitino components are suppressed 
by the Planck scale. On the other hand, the vast majority of SUSY theories have 
relatively low SUSY breaking scale, VF < Mp), so the massive gravitino interacts 
with matter mostly through its longitudinal component, the would-be goldstino. 
This interaction has the form (9.61). To estimate the interaction strength, let us 
integrate the action by parts and obtain 


1 SUSY 
Lapp Pd, . 


At low energies, the divergence of the supercurrent is due to explicit breaking of 
supersymmetry in the effective action. This breaking manifests itself in the mass 
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differences between particles and superpartners. Hence, the effective gravitino inter- 
actions with other particles are!® 


2 ~ 
TEF, (9.642) 
M,- 
TA pE (aa) 


where f and À are the sfermion and gaugino, respectively, and mg and M, are their 
masses. The interesting and phenomenologically viable range of parameters is 


100 GeV S mg, My S10 TeV. 


Hence, the effective couplings (9.64) are small, and often very small. This is the 
reason for peculiar phenomenology and cosmology of the gravitino. Yet there is one 
property of the gravitino in common with other superpartners. It is clear from (9.61) 
and (9.62) that in theories with conserved R-parity, the gravitino is odd under 
R-parity. Hence, it is created and annihilate in a pair with itself or with another 
superpartner. Furthermore, the gravitino is LSP provided that its mass is roughly 
in the range 


2-10-*eV S ms/2 < 100 GeV, 
This range corresponds to fairly low SUSY breaking scale, see (9.63), namely 
VF < 10" GeV. 


In theories of this sort gravitino is stable and may be a dark matter candidate. We 
note, though, that direct detection of gravitino dark matter is very unlikely in the 
foreseeable future because of very weak gravitino interactions with matter. 


A class of realistic SUSY models with gravitino LSP makes use of the gauge mechanism of 
mediation of SUSY breaking to the SUSY Standard Model (for reviews see, e.g., Refs. [133, 
134]). SUSY is assumed to be broken in the secluded sector, while the superpartners of the 
Standard Model particles obtain masses due to the usual gauge interactions of the Standard 
Model. The interaction with the secluded sector is due to messengers, heavy fields that 
are charged under the Standard Model gauge group and at the same time couple to the 
secluded sector. As a result of the latter interactions, the masses of scalar messengers q 
get split 


„A 
"M 


m =m (13 J A< M, 


where M is the mass of fermionic messengers, and the parameter A? is proportional to 
SUSY breaking vacuum expectation value F. 


16One might question (9.64) as it does not appear to have smooth limit as F — 0 in which 
supersymmetry is restored. This is not the case: superpartner mass parameters m2, are themselves 
proportional to F. The ratio (9.64) is in fact determined by the couplings of those interactions in 


the full theory which provide masses to superpartners after SUSY breaking. 
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Messengers induce one loop corrections to the gaugino masses, 
M)(M) ~ =A, (9.65) 
4a 


where a is the relevant gauge coupling of the Standard Model. The fields from the scalar 
sector of SUSY Standard Model obtain contributions to their masses at two loops, 


mÈ(M) ~ (=y A. (9.66) 


TT 


The proportionality coefficients in (9.65) and (9.66) are in general different for different 
fields. 

Hence, the non-supersymmetric masses of scalar and fermion superpartners at the scale 
M depend on their quantum numbers only, and are of the same order, 


a 


7e A. 


My~ m7 ~ 


Flavor-independence of the gauge interactions ensures the absence of new flavor-changing 
parameters at the scale M. Gravitational interactions between the secluded sector and 
SUSY Standard Model fields lead, generally speaking, to the appearance of these parame- 
ters. In particular, the gravitational effects generate flavor-nondiagonal masses for squarks 
and sleptons, whose estimate is 


Am;_, ~ F/Mp, ~ ™3/2+ 


These masses are phenomenologically acceptable (they do not lead to contradictions with 
data on rare decays of leptons and mesons), provided that 


m 
— ae g 107? — 107, mẹ ~ 500 GeV. 


This gives the upper bound on the gravitino mass in gauge mediation models, 
ms3/2 S (107? — 107°) -mp ~ 1 GeV, 
as well as on the SUSY breaking scale, 
VF < 10° GeV. 


On the other hand, the lower bounds on m3/2 and VF come from the experimental bounds 
on the superpartner masses; roughly speaking, 


My, m; = 300 GeV. 
Making use of this estimate, we find from (9.65) and (9.66) that 
VF > A> 30 TeV, 
ms/2 Z leV. 


While the gravitino is LSP in the gauge mediation models, NLSP is either a neutralino or 
right 7-slepton. The latter case occurs more often. 

We note that gravitino production mechanisms in this class of models include processes 
with messengers and with particles from the secluded sector. These interactions may change 
the estimates given in the text. 
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Fig. 9.12. Processes 2 — 2 with gravitino annihilation or production; Y, X;, i = 1,2,3 are the 
Standard Model particles and their superpartners. In models with conserved R-parity at least one 
of the particles X; must be a superpartner. 


Let us discuss cosmology with stable gravitino. Because of very weak gravitino 
interactions, processes involving two gravitinos (including creation and annihilation 
of two gravitinos) are negligible. Let us begin with the possibility that gravitino was 
in thermal equilibrium at high temperatures. 

Gravitino G is created and destroyed in reactions 


X1+ Go Xo+ X3 (9.67) 


where X;, i = 1,2,3 are other particles in the cosmic plasma. These reactions pro- 
ceed via s-, t- and u-channel exchange of virtual particles Y; see Fig. 9.12. Each 
diagram involves the coupling between the particles X1,X2,Y (e.g., gauge cou- 
pling) and the gravitino coupling (9.64). At high tempeartures, when T > mg, My, 
processes due to gravitino-gaugino interactions (9.64b) dominate, and the cross- 
sections of the processes (9.67) are estimated as follows: 
Mx g? 
0a Ope a= An 

(The cross-sections involving gravitino-sfermion interactions (9.64a) are additionally 
suppressed by a factor m2/T?.) Gravitino freeze-out temperature is determined, as 
usual, from the relation 


where nx is the equilibrium number density of particles in the initial state of the 
reaction (9.67) and vg is the gravitino velocity. Let us assume that freeze-out occurs 
when gravitinos are still relativistic. Then durect and inverse processes (9.67) freeze 
out at the same time. We take nx ~ T°, v@ = land estimate the number of channels 
as g.(Zy). This gives the following estimate: 


1 F 
Vaa Mx’ 


where one of the factors F has been expressed through the gravitino mass according 
to (9.63). 
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Since F > M3, gravitino freeze-out indeed occurs when gravitino is relativis- 
tic.!” Hence, the estimate of the present gravitino abundance is made in the same 
way as the estimate for neutrino, see Sec. 7.2. The expression for gravitino is 

3 93/2 43 i 
=--(——)})-.-—-— : 9.69 
3 /2,0 ri ( 2 ) Thy 05 ( ) 
where we used the expression (5.33) for entropy at T < 1 MeV. Here g3/2 is the 
number of active degrees of freedom of gravitino; since the interaction of the trans- 
verse degrees of freedom is suppressed by negative power of Mp ;, we should set 
93/2 = 2. 


Problem 9.10. Derive the formula (9.69). 


Problem 9.11. Find freeze-out temperature of transverse degrees of freedom of 
gravitino. 


The result (9.69) shows that the present gravitino mass density is 
_ ™3/2° 73/2 ( 13/2 ) (££) 210 1 
Q32 = ——— = 0.2 ——).- —.: 9.70 
a De 200eV/ 2 u(y) ) 2h? (7a 


This means that stable gravitinos of masses m3/2 Z 200eV are cosmologically for- 
bidden, if these gravitinos were in thermal equilibrium at high temperatures. This 
translates into the bound on the SUSY breaking scale, 


VF < 10° TeV. 


Gravitino of mass m3/2 ~ 200 eV would make the whole of dark matter. However, 
this mass is below the lower bound (9.4) dictated by the analysis of structures. 
Thus, if gravitinos were in thermal equilibrium in the early Universe, they cannot 
serve as dark matter candidates. 


Problem 9.12. Show that stable gravitinos of mass m3/2 < 200eV that were in 
thermal equilibrium at high temperatures are safe from the viewpoint of primordial 
nucleosynthesis. 


Gravitino as dark matter is still an option if the freeze-out temperature (9.68) 
was never reached in the Universe. One of the possible scenarios is as follows. Let us 
assume that the thermal history of the Universe began at some temperature Tmax- 
Let us also assume that gravitinos were not produced before the hot stage, so that 
there are no gravitinos at T = Tmax, while the Standard Model particles and their 
superpartners are at thermal equilibrium. The gravitino production then occurs in 
the conventional hot Big Bang regime. 


17 For m3/2 < 10keV, the estimate (9.68) gives the freeze-out temperature below the scale of 
superpartner masses, Ty < Mh. In this case the processes (9.67) terminate, roughly speaking, at 
Tf ~ My, since at least one of the particles X1, X2, X3 must be a superpartner. Freeze-out still 
occurs in the regime of relativistic gravitino, so the argument that follows remains valid. 
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In fact, this situation is quite realistic in inflationary theory. According to this theory, 
energy density before the hot stage was due to the inflaton field, and the hot plasma was 
produced as a result of the decay of this field (see details in the accompanying book). 
Naturally, this picture implies that the maximum temperature is well below the Planck 
mass. This temperature is actually model dependent and varies a lot in concrete inflation- 
ary models. Gravitino interactions with inflaton may be very weak, so gravitinos were not 
produced prior to the hot stage in this picture. 


At the hot stage, gravitinos are predominantly produced in two-body decays of 
superpartners, 


X,-G+X;, i=1,... (9.71) 
and in two-particle scattering processes, 
Xit+Xj > Xe+G, i,j,k =1,... (9.72) 


With these two production channels, the Boltzmann equation for the gravitino 
number density is 
dng /2 
dt 


+ 3Hn3/2 = dT a, yj," NE, +X (ov); -NX,NX;, (9.73) 


where Tẹ, is the width of the decay (9.71), y; is the Lorentz-factor of X;-particles 
that gives rise to the decrease of their decay rate, nx, is the number density of 
particles X; and (av);,; is the production rate in the reaction (9.72) (thermal average 
of the product of cross-section and relative velocity.) We neglected here the inverse 
processes Xk + em Xi + X; and X; + Ga Xi; this is the valid approximation for 
gravitino abundance well below its equilibrium value. In terms of gravitino-entropy 
ratio A3/2 = 3/2/s and temperature, the Boltzmann equation reads (we follow the 
logic as in Sec. 9.6.1) 


dA3/2 Ig, He NX, NX; 
= . —> Z T4 
ToT ae ae ts ov) (9.74) 


We now make use of the formulas (9.64) for the gravitino coupling to estimate the 
width and cross-section, 


My MR 
T ~ SS . 
x, 7 IGF? imi M, ’ (9 75) 
M? 8r M? 
ij = const -a— = const -a— - ——~—. ! 
(ov)i; = const - a F2 = const az m2 MP, (9.76) 


As we already poined out, at T >> Mx the gravitino production in scattering is 
dominated by its interaction with the gauge particles, Eq. (9.64b). Note also that 
neither the width (9.75) nor the cross-section (9.76) depends on temperature. The 
constant in (9.76) is determined by the number of possible final states. 
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The two terms on the right-hand side of Eq. (9.74) behave in a very different 
way as functions of temperature. At high temperatures one has ng, ~ ny x a 
and y;' œ T~!. Hence, the first and the second terms are proportional to T~* and 
T, respectively. We see that the gravitino production in decays is not particularly 
sensitive to Tmax, while the number of gravitinos produced in scattering increases 
linearly with Tmax- 

Let us first discuss gravitino production in decays of superpartners. There are 
two types of processes relevant here. One is the decays of all superpartners at high 
temperatures when they are relativistic or almost relativistic. The other is possible 
delayed decay of next-to-lightest superpartner, NLSP. Let us consider them in turn. 

The contribution of the superpartner X into the first term in (9.74) behaves as 
T-3 at T >> Mg and exp(—Mx/T) at T < Mg (assuming that X is in thermal 
equilibrium; the opposite situation may be relevant for NLSP and will be discussed 
later.) Hence, the gravitino production occurs predominantly at T ~ Mx, and we 
obtain for this contribution 

3 
ae oe (9.77) 


We see that the largest contribution comes from the heaviest superpartners (pro- 
vided that Tmax Z Mg) and that the abundance is larger for lighter gravitino. The 
latter property is due to the fact that lighter gravitinos correspond to lower SUSY 
breaking scale, see (9.63), and hence stronger gravitino interaction with matter. We 
find from (9.77) the contribution to the present gravitino mass density, 

S0 M$ 


03/2 ears 7 


Pe g °mgj2Mpı i 


Plugging in numbers, Mg < 10 TeV, gą ~ 200, we find that the first decay mecha- 
nism is capable of producing gravitino dark matter for m3/2 < 1 MeV. Interestingly, 
the right amount of gravitino warm dark matter, m3/2 ~ 1-10 keV (see Sec. 9.1), 
is obtained for fairly light superpartners, Mg < 300 GeV; see Ref. [135] for details. 

Let us turn to the second decay mechanism, delayed decays of NLSP. Neglecting 
for the moment its interactions with gravitino, we apply the analysis of Sec. 9.6.1. 
The NLSP freeze-out temperature is 


Myusp 
20 ` 
NLPS do not decay by that time provided that 


Tf wisp as 


2 


T ig 
I xzse < A(T} wrse) = a 
Mpi 


Applying the estimate (9.75) to NLSP we see that this condition holds for gravitino 
of mass m3/2 Z 10keV. Let us concentrate on this case. After NLSP decays, the 
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number of gravitinos in comoving volume is the same as freeze-out number of NLSP. 
This immediately gives 

_ M3/2 
P0,3/2 = M PO,NLSP 5 


NLSP 
where po,nzsp is a would-be mass density of stable NLSP. The latter has been esti- 
mated in Sec. 9.6.1, where we have seen that the mass density is often higher than 
the critical density, 


Po,wuse ~ (LO—1000) pc. 
So, the second decay mechanism of gravitino production leads to the right gravitino 
dark matter density for relatively heavy gravitino, 


100 SY) 


~ (0.1—1 . 
13/2 (0 0) GeV ( Masse 


This scenario is somewhat problematic, however, since according to (9.75), the 
NLSP lifetime is large, 


i m3/2\2 /100GeV\* 
Tyrsp = lip ~ 5+ 104s- (e) (a) l 


= 100s, NLSP decays occur at or after BBN epoch. The decay products 
in the end contain hard photons, electrons and other particles which may destroy 
light nuclei in primordial plasma, hence spoiling the predictions of BBN theory, see 
the end of Chapter 8. This problem does not occur if gravitino is sufficiently light 
(m3/2 S$ 100 MeV) and/or NLSP is sufficiently heavy (Myisp Z 300 GeV). 


We note that in the latter scenario gravitinos are born relativistic. Hence, they 
effectively serve as warm dark matter. Their distribution in momenta is far from 


For Turse Z, 


thermal; this, generally speaking, is relevant for structure formation. 


Problem 9.13. Find the distribution over momenta of gravitinos produced in NLSP 
decays after NLSP freeze-out. 


Problem 9.14. Let gravitino of mass 100 MeV be dark matter particle, and the 
mass and lifetime of NLSP be 200 GeV and 10s. Estimate the present spatial size 
of density perturbations suppressed as compared to the CDM case (see Sec. 9.1). 


As an example of BBN bounds, let us consider mSUGRA model of Sec. 9.6.1 and add a light 
gravitino to it. Let gravitino be LSP. Let us treat the gravitino mass as a free parameter 
(which, strictly speaking, is not the case in simple models of gravity mediation). In this 
model, NLSP is either neutralino or superpartner of right tau-lepton. The bounds on the 
parameters of this model [136] are shown in Fig. 9.13 and Fig. 13.9 on color pages. 


Let us now turn to gravitino production in scattering, the second term in (9.74). 
It is important for Tmax Z Ms. At T ~ Tmax matter particles are thermalized, and 
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Fig. 9.13. Bounds [136] on the plane (M1 /2, Mmo) in mSUGRA model with m3/2 = 100 GeV and 
two values of tan 3. See Fig. 13.9 for color version. 


their number densities are all of order of the photon number density. We parame- 
terize the sum in the right-hand side of (9.74) as 


S"(ov) ig nxnx = (hoan, 

ij 
and make use of the estimate (9.76). The largest contribution to (o4o4) comes from 
colored particles; numerically const ~100 in (9.76). Using this estimate for (Ctot), 
we obtain the scattering contribution to gravitino-to-entropy ratio as the second 
term in the right-hand side of Eq. (9.74) at T = Tmax. This gives 


Ny (Tinax) 


m3 /2T max ; gx (To) 
ToaxH (Tmax) 


03/2 = ——— - ny,0 


i (Ctot) : 
Pe Gx (Triax) 


Numerically, 


2 
Q3 72 ~ a) ( Tmax ) ; (5) . =) Toa (9.78) 
m3/2 10 TeV 1 TeV Os Tine) 2h 
Since there is linear dependence on Tmax, the maximum temperature in the Universe 
must be bounded from above. 

The estimate (9.78) can be refined by integrating the Boltzmann equation (9.74) 
numerically. The result [137] for a SUSY model with bino NLSP and the mass 
hierarchy Myrsp = 50GeV < Ms = 1TeV is shown in Fig. 9.14. This type of 
cosmological bounds is very restrictive, since in simple models of reheating the 
maximum temperature exceeds T ~ 108 GeV. We note that in this model, relic 
gravitinos make all of dark matter for parameters just below the line shown in 
Fig. 9.14. It is clear both from this Figure and from Eq. (9.78) that obtaining the 
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Fig. 9.14. Bounds [137] on the parameters (Tmaz, m3/2). The region above the line is excluded 
because of gravitino overproduction. The solid line corresponds to 03/2 h? ~ 1. 


right dark matter density with gravitinos requires tuning parameters of different 
nature: the parameter m3/2 has to do with particle physics while Tmax characterizes 
the early cosmology. 

To conclude this Section, let us consider one possibility that exists in SUSY 
models with unstable but not very heavy gravitino. This case is typical for simple 
mSUGRA models. The gravitino lifetime is large, so these models may lead to the 
cosmological scenario with intermediate domination of non-relativistic gravitino. 
This is the first of the two scenarios considered in Sec. 5.3. For heavy gravitino, the 
main contribution to the decay rate comes from “original” transverse components 
(not goldstino), and the estimate for the width is 


T3729 2% —. 
came 

Let the maximum temperature in the Universe be so high that gravitinos were in 

thermal equilibrium with the rest of plasma (this assumption may be relaxed, see 


Sec. 5.3). Since gravitino freeze-out occurs in relativistic regime, gravitino mass 
density at T Ș m3/2 is given by 


p3/2(T) ~ mgy2T®. 


So, the intermediate matter-dominated epoch is indeed possible for long-lived grav- 
itino. The requirement from BBN, that (see (5.49)) 


T2 
[3/2 > _ ~BBN 
Mýı(Tegn) 
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gives rather strong lower bound on the gravitino mass, 


2 2 71/3 
m3/2 > Fae , Le, mg3/o > 45 TeV. 
Moreover, the self-consistency of the cosmological model requires either quite light 
LSP (if stable), Mzsp ~ 10 MeV, or even much heavier gravitino. 

This bound is fairly restrictive in models with gravity mediated SUSY breaking. 
This bound, however, does not apply to cosmological scenarios in which gravitino 
mass density was never high. 


9.7. Axions and Other Long-lived Particles 


Many extensions of the Standard Model contain light scalar or pseudoscalar par- 
ticles. In some models these new particles are so light and so weakly interacting 
that their lifetime exceeds the present age of the Universe. Hence, they may serve 
as dark matter candidates. Depending on the context, they are called axions (see 
below), dilatons, familons, sgoldstino, etc. 

Let us consider general properties of models with light scalars or pseudoscalars. 
These particles should interact with the usual matter very weakly, so they must be 
neutral with respect to the Standard Model gauge interactions. This implies that 
interactions of scalars S and pseudoscalars P with gauge fields are of the form 


Corr Cprr 
— SF p F”, = 
7 S M Lprr 3 


where Fuy is the field strength of SU (3)e, or SU(2)w, or U(1)y gauge group. The 
parameter A has dimension of mass and can be interpreted as the scale of new 
physics related to S- and/or P-particle. This parameter has to be large, then the 
interactions of S and P with gauge bosons are indeed weak at low energies. Because 
of that, the Lagrangians (9.79) contain gauge-invariant operators of the lowest pos- 
sible dimension; in principle, one could add to (9.79) terms like A~°S(Fi,,)*, but 
their effects would be negligible at energies well below A. Dimensionless constants 
Cgrr, Cprr are determined by the fundamental theory valid at energies above A. 
These parameters run with the energy scale Q? characterizing the processes con- 


Lsrr = PP Belen, (9.79) 


sidered. However, the latter property will not be important for our estimates, and 
we will assume that these parameters are numbers of order 1. In the standard 
basis of the Standard Model gauge fields, the terms (9.79) describe interactions 
of (pseudo)scalars with pairs of photons, gluons, as well as with Zy-, ZZ- and 
WtW~-pairs. 

Interactions with fermions can also be written on symmetry grounds. Since S' and 
P are singlets under SU (3)e x SU(2)w x U (1)y, no combinations like Sf f or Pfy>f 
are gauge invariant, so they cannot appear in the Lagrangian (hereafter f denotes 
the Standard Model fermions). Gauge invariant operators of the lowest dimension 


256 Dark Matter 


have the form H f f, where H is the Higgs field. Hence, the interactions with fermions 
are 


YPHff 
A 


It often happens that the couplings Ysz7¢ and Yspg are of the order of the Standard 
Model Yukawa couplings, so upon electroweak symmetry breaking the low energy 


SHF 


Y, = 7 
Lug = SHES, Leng = - PH fof. 


Lagrangians have the following structure, 


C m > C m = 
Leg = -SFf, Lrg = SE. Phys, (9.80) 


where we assume that dimensionless couplings C's and Cpg are also!’ of order 1. 

Making use of (9.79) and (9.80) we estimate the partial widths of decays of P and 
S into the Standard Model particles (of course, we are talking about kinematically 
allowed decays) 


3 2 
™ pcs) mMyMP(s) 
Tpsysaa © aS.) OP sy pp ~ ——— 9.81 
P(S) AA ~ Gaga? POSIN gop (9.81) 
where A denotes vector bosons, and we omitted threshold factors. By requiring 
that the lifetime of new particles exceeds the present age of the Universe, Tg(p) = 
rip) > Ho 1 we find a bound on the mass of the dark matter candidates, 


mps) < (167A? Ho)". (9.82) 


Assuming that the new physics scale is below the Planck scale, A < Mp), we obtain 
an (almost) model-independent bound, 


mps) < 100 MeV. (9.83) 


Hence, the kinematically allowed decays are P(S) —> yy, P(S) —> vi and P(S) > 
ete. It follows from (9.81) that the two-photon decay mode dominates, unless 
the mass of the new particles is close to that of electron. The decay P(S) > yy 
enables one to search for the dark matter particles P(S): they would produce a 
photon line of energy equal to half of the mass of P(S). The relative line width 
must be of order 1073, which is roughly the velocity disperion in a galaxy. The 
results of this search by cosmic telescopes are shown in Fig. 9.15. It is clear from 
this Figure that the lifetime of a dark matter particle must exceed considerably the 
age of the Universe: the typical value 104 on vertical axis corresponds to the lifetime 
Tp(s) ~ 107° s ~ 3-10! years. 


Problem 9.15. Making use of Fig. 9.15 and assuming that the dominant decay 
channel is P(S) — yy, refine the estimate (9.83). 


18In some cases the sets of constants {Cff}, {Cpp} have hierarchical structure. To simplify the 
discussion, we do not consider these cases in what follows. 


9.7. Axions and Other Long-lived Particles 257 


10° 


=’ = =A = 
Q, ° Q =] 
w > Th 


F (J > yy) [s1] x 107° 


a 
2. 


= 
=] 
© 


107 10 105 10% 103 102 10-7 10° 100 10 103 104 
E, (GeV) 


Fig. 9.15. Observational bounds on partial decay width of dark matter particle to two pho- 
tons [168]. 


Let us now consider generation of relic (pseudo)scalars in the early Universe. 
There are several generation mechanisms; two of them are fairly generic for the class 
of models we discuss. These are generation in decays of condensates and thermal 
generation. (We will consider yet another mechanism later, in the model with axion.) 
Let us briefly discuss the two mechanisms in turn. We will denote the light particle 
by S for definiteness. 

Let some scalar field ¢ be in condensate in the early Universe; we recall here 
that the condensate is the homogeneous scalar field that oscillates at relatively late 
times, when mg > H. In other words, the condensate is a collection of @-particles 
at rest; see Sec. 4.8.1. Let both particles, @ and S, interact with matter so weakly 
that they never get into thermal equilibrium, and let the interaction between ¢ and 
S have the form pS?/2, where u is the coupling constant. Then the width of the 
decay ¢ — SS is estimated as!® 


2 


goss ~ (9.84) 


167mg 


If the widths of other decay channels do not exceed the value (9.84), the decay of 
é-condensate occurs at temperature Ty determined by?° 


2 
TS 


Iygss ~ H(Tọ) = Me 
Pl 


19 As we discuss in the accompanying book, the decay of condensate may be faster due to the Bose 
enhancement. We leave this possibility aside for simplicity. 

20We assume for definiteness that the energy density of ¢-condensate is small compared to that 
of hot matter. 
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Let the energy density of ¢-condensate at that time be equal to pg, so that the 
number density of decaying ¢-particles is ng ~ pg/mg¢; see Sec. 4.8.1. Immediately 
after the epoch of ¢-particles decays, the number density of S-particles is of order 
€p¢/Me¢, where e is the fraction of the condensate that decayed into S-particles. 
After S-particles become non-relativistic, their mass density is of order 


where we omitted the dependence on g, for simplicity. In this way we estimate the 
mass fraction of S-particles today, 


ps msTy _ €Ppo ( Ms ) Epo 
Qs = = ~ —. ~ 0.2. . : 9.85 
© pe Pe mel leV meT3 ( ) 


With appropriate choice of parameters, the right value Qs ~ 0.25 can be obtained 
indeed. We note that the last factor on the right-hand side of (9.85) must be small. 

Let us now consider production of light (pseudo)scalars in collisions of the Stan- 
dard Model particles in primordial plasma. Since the interactions (9.79) and (9.80) 
are very weak, and since they involve three particles, the dominant processes of 
S-particle creation and annihilation at temperatures T « A are?! 


S + Xi << Xo + X3. (9.86) 


These are analogous to the processes (9.72) which we discussed in the gravitino case; 
the relevant diagrams are basically the same as in Fig. 9.12 (with S substituted for 


G). Making use of (9.79) and (9.80) we obtain the estimate for the cross-section in 
the relevant case ms < T < A, 


a 
75 ~ Aa 


where a = g?/(4m), and g is the Standard Model coupling entering the vertex 
Y X2X3 (see Fig. 9.12). We now equate the Hubble time to the mean free time of 
the Standard Model particles with respect to the emission of S-particle, 
1 1 M%, 
g wv SS i SS awa EL 
os(p)ny(T) H(T) T? 


and obtain the freeze-out temperature of S-particles, 


A 
Mp, 


Ten o7tA- (9.87) 


If the temperature in the Universe had ever exceeded Tsp), the S(P)-particles were 
in thermal equilibrium. In that case their properties are the same as the properties of 


21At T > mp there are also processes S +h +> ff and crossing processes. The order-of-magnitude 
estimates below apply to these processes as well. 
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thermally produced gravitino. The estimate (9.70) applies, and one concludes that 
the mass of the dark matter S-particle must be unacceptably small, mg ~ 200eV. 


Problem 9.16. Neglecting nucleosynthesis constraints, estimate the fractional mass 
density Qs in models where S'-particles freeze out being non-relativistic, i.e., may 
pretend to make cold dark matter. 


If the maximum temperature in the Universe was lower than Ts, the number 
density of thermally produced S-particles is smaller than in the above case. The 
mass mg may then be in the acceptable range of 1 keV and larger. The corresponding 
analysis is similar to that for gravitino, and we do not make it here. 

Let us now turn to a concrete class of models with Peccei-Quinn symmetry 
and axions. This symmetry provides a solution to the strong CP-problem, and the 
existence of axions is an inevitable consequence of the construction. 


Let us briefly discuss the strong C P-problem [138-140] and its solution in models with 
axion. The starting point is that one can extend the Standard Model Lagrangian (see 
Appendix B) by adding the following term, 


ALo ae . bo - GIG °, (9.88) 
8T 


where as = g? /(4r) is the SU (3). gauge coupling, G4, is the gluon field strength, Guy a — 
der? GS, is the dual tensor and fo is an arbitrary dimensionless parameter (the factor 
as /(87) is introduced for later convenience). The interaction term (9.88) is invariant under 
gauge symmetries of the Standard Model, but it violates P and CP. The term (9.88) can 
be written as the divergence of the vector (we consider the theory in Minkowski space for 
definiteness) 


Qs 
ALo = T 0 uk", 
where 
K” = clvre ; (cac; Æ tyasakas) : 


and G% is the gluon vector potential. Hence, the term (9.88) does not contribute to the 
classical field equations, and its contribution to the action is reduced to the surface integral. 
For any perturbative gauge field configurations (small perturbations about G% = 0) this 
contribution is equal to zero. However, this is not the case for configurations of instanton 
type. This means that CP is violated in QCD at non-perturbative level. 

Furthermore, quantum effects due to quarks give rise to the anomalous term in the 
Lagrangian?’ which has the same form as (9.88) with proportionality coefficient determined 


?2We have in mind the anomaly in the axial current corresponding to U(1),4 transformations, 
Oud x pea Gh =. 
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by the phase of the quark mass matrix My. The latter enters the Lagrangian as 
Lm = Gt Mgqr + hc. 


By chiral rotation of quark fields one makes quark masses real (i.e., physical), but that 
rotation induces a new term in the Lagrangian, 


ALm = = - Arg(DetM,) - GL,” *. (9.89) 


There is no reason to think that Arg(DetM,) = 0. Neither there is a reason to think 
that the “tree-level” term (9.88) and anomalous contribution (9.89) cancel each other. 
Indeed, the former term is there even in the absence of quarks, while the latter comes 
from the Yukawa sector, as the quark masses are due to their Yukawa interactions with 
the Englert—Brout—Higgs field. 

Thus, the Standard Model Lagrangian should contain the term 


ALo = ALo + ALm = Eo + Arg(DetM,))G2,G"” * = = Ger (oop) 


This term violates CP, and off-hand the parameter 0 is of order 1. 
The term (9.90) has non-trivial phenomenological consequences. One is that it 
generates electric dipole moment (EDM) of neutron?’ dn, which is estimated as [141, 142] 


dn ~ 9-107*8 -e-cm. (9.91) 
Neutron EDM has not been found experimentally, and the searches place strong bound [1] 
dn S3-10°*° . e- cm. (9.92) 
This leads to the bound on the parameter 0, 
|ð] < 0.3 - 107°. 


The problem to explain such small value of @ is precisely the strong CP-problem. 

A solution to this problem apparently does not exist within the Standard Mode 
The solution is offered by models with axion. These models make use of the following 
observation. If at the classical level the quark Lagrangian is invariant under phase rotations 
of quark fields 


1,24 


iP a9 (PQ) 


dun > è E ee ee (9.93) 
where ega) is the axion charge of nth quark flavor, and }>, ego Æ 0, then the 6-term 
would be rotated away by applying this transformation. Indeed, under this transformation 


23Neutron EDM corresponds to the interaction of neutron spin S with electric field E described 
by the Hamiltonian H = d,ES/|S|. 

4Tf at least one of the light quarks (e.g., u-quark) were massless, there would be no effects due to 
the 6-term. At the classical level, there would exist a global U(1)4-symmetry of y° phase rotations 
of u-quark, uz, —> tfur, ur —> e~t up. This classical symmetry could be employed to set @ equal 
to zero. However, experimental data imply that all quarks are actually massive, so this solution of 
the strong C’'P-problem is most likely ruled out. 
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one has Arg(DetM,) > Arg(DetM,) + 3, and hence 0 — 0 + 3, where the axion charges 
are normalized in such a way that 


D aI 


This global symmetry is called Peccei-Quinn symmetry [143] U(1) pa. Peccei-Quinn (PQ) 
symmetry is explicitly broken in the Standard Model by Yukawa couplings of quarks to 
the Englert-Brout—Higgs field (see Sec. B.1; we omit group and generation indices), 


Ly =Y“Q_HDp+Y"Q,it’H’* Up. (9.94) 


Indeed, the first term in (9.94) would be invariant under transformations (9.93) with 
epo = 1/6 supplemented with the phase rotation H — 9/6 , while the second term 
ply be invariant if the phase of the scalar field was rotated in the opposite way, H — 
eP/6 H, 

One can, however, extend the Standard Model in such a way that the PQ symmetry is 
exactly at the classical level. Quark masses are not invariant under the PQ transformations 
(9.93), so PQ symmetry is spontaneously broken. At the classical level, this leads to the 
existence of massless Nambu—Goldstone field a(x), axion. As for any Nambu—Goldstone 
field, its properties are determined by its transformation law under the PQ-symmetry 


a(x) > a(x) + 8- fra, (9.95) 


where (3 is the same parameter as in (9.93) and fpo is a constant of dimension of mass, 
the energy scale of U(1)pg symmetry breaking. The mass terms in the low energy quark 
Lagrangian must be symmetric under the transformations (9.93), (9.95), so the quark and 
axion fields enter the Lagrangian in the combination 


Lm = R, Mane ©” TPR, + hc. (9.96) 


Making use of (9.89) we find that at the quantum level the low energy Lagrangian contains 
the term 
Qs a ~ 


a=—:— Gia". 9.97 
8x freq “ ( ) 


Clearly, PQ symmetry (9.93), (9.95) is explicitly broken by quantum effects of QCD, and 
axion is a pseudo-Nambu—Goldstone boson. 

Hence, -parameter multiplying the operator G2, 
space-time point and proportional to the axion field, 


HY a obtains a shift depending on 


E E (9.98) 
fra 


Strong interactions would conserve CP provided the axion vacuum expectation value is 


such that (0) = 0. The QCD effects indeed do the job. They generate non-vanishing quark 
condensate (qq) ~ Abcp at the QCD energy scale Agcp ~ 200 MeV. This condensate 
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breaks chiral symmetry and in turn generates the axion effective potential?” 
l-2 muma ,_ aA l MuMa 72 2 92 z4 
a ~ — 50 —— 0°) = ~-———— 0 - Mmi fa 0), : 
Vow — 59 Me (Ga) + O08) = 5 ma fz + O08) (9.99) 


where mz = 135 MeV and fr = 93 MeV are pion mass and decay constant, and Mu/Ma © 


0.5. The potential has the minimum at (@) = 0, so the strong C P-problem finds an elegant 
solution. It follows from (9.98) and (9.99) that axion has a mass 


mse (9.100) 
fro 

i.e., it is indeed pseudo-Nambu-Goldstone boson. 
The simplest way to implement PQ mechanism is to add the second Englert—Brout-— 


Higgs doublet to the Standard Model and choose the Yukawa interaction as 


Y’QLHıDr + Y”Qrir  HžUr. (9.101) 
Then the axion charge of every quark equals efo = 1 /6, while the two scalar fields 


transform under U(1)pg-transformation (9.93) as follows, 
Hı > éH, Ha > eH, 


This ensures U(1)pq-invariance of the Lagrangian (9.101), and hence the absence of the 
0-term. 

Unless other new fields are added, spontaneous breaking of PQ symmetry in this sim- 
plest case occurs due to the Englert-Brout—Higgs expectation values. Then the classically 
massless field (would-be Nambu-Goldstone boson) is the relative phase of the two fields 
Hı and H2. To obtain the low energy Lagrangian let us write 


; 0 0 
Hy = t| a |, Has eO] v |, (9.102) 
V2 v2 


where vı and v2 are the Englert-Brout—Higgs expectation values. They both contribute 
to the W- and Z-boson masses, hence 


4/ V? + v2 = v = 246 GeV. 


The kinetic term for the field B(x) comes from the kinetic terms of the Higgs doublets, 
Lkin, H = ô H10" Hı + 0, H4O" Ho. 
Inserting (9.102) into this expression, we find 


fea 
2 


Lokin; = 0,80" B, 


where 


2 2 
a (9.103) 


Sg Ge 7r 


25 Since (9.93) is the symmetry under phase rotations, the axion potential must be periodic in 6 with 
period 27. The simplest periodic generalization of the expression (9.99) is Va = m2 fo -(1—cos 8), 
while the computation within chiral perturbation theory gives a more cumbersome formula for 
Va [160]. This point will not be important for us in what follows. 
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The axion field is related to G(x) by 
a(x) = fra: B(x); 


with this definition the field a(x) has the standard (“canonical”) kinetic term. The axion 
is rather heavy in this model: we find from (9.100) that 


Ma ~ 150 keV. 


The interaction of this axion with quarks, gluons and photons (see below) is rather strong, 
and this particle, called Weinberg—Wilczek axion [144, 145], is experimentally ruled out. 

This problem is solved in models with “invisible axion”, in particular, in Dine- 
Fischler-Srednicki—Zhitnitsky [146, 147] (DFSZ) model and Kim-—Shifman-—Vainshtein— 
Zakharov [148, 149] (KSVZ) model. The idea is to make the scale of PQ symmetry breaking 
independent of the electroweak symmetry breaking scale. In DFSZ model this occurs in 
the following way. One adds into the model with the Lagrangian (9.101) complex scalar 
field S which is singlet under the Standard Model gauge group. One also adds interactions 
involving PQ invariants 


S's, HiH2- S. 
The field S transforms under U(1)pq as 
S — e’ s. (9.104) 


The axion field is now a linear combination of the phases of fields Hı, H2 and S. By 
repeating the calculation leading to (9.103) we find 


Jur +03 +02 (9.105) 


feo = V4“, 


where vs is the vacuum expectation value of the field S. The latter can be large, so it is 
clear from (9.105) that the mass of axion is small and, most importantly, its couplings to 
the Standard Model fields are weak: these couplings are inversely proportional to fpo ~ vs. 
Note that DFSZ axion interacts with both quarks and leptons. 

The KSVZ mechanism makes use of additional quark fields Vz and WV; which are 
triplets under SU(3)-. and singlets under SU(2)w x U(1)y. Only these quarks transform 
non-trivially under U(1)pa@, ere) = 1, while the usual quarks have zero PQ charge. One 
also introduces a complex scalar field S which is a singlet under the Standard Model gauge 
group. One writes PQ-invariant Yukawa interaction of the new fields, 


L = yo SV rr + h.c., 


so that S transforms under U(1)pq as S —> ef S. PQ symmetry is spontaneously broken 
by the vacuum expectation value (S) = vs /v2. The axion here is the phase of the field S, 
therefore 


fpo = vs. (9.106) 


The KSVZ model does not contain explicit interaction of axion with usual quarks and 
leptons. 

The Peccei—Quinn solution to the strong CP problem has its own drawbacks. First, 
one expects that any global symmetry is broken by gravitational interactions. This may 
give rise to terms in the low energy Lagrangian which break U(1) pq explicitly and have 
the form veg /M a. where veg is a (pseudo)scalar combination of the fields with non- 
zero PQ charge and mass dimension N. These terms would generically shift the vacuum 
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expectation value (0) from zero to a value of order 1. In other words, axion solution to the 
strong CP problem is unstable against the addition of operators which break U(1)paq, even 
though they are suppressed by the Planck scale [169]. Second, models with spontaneously 
broken global U(1) symmetry generically have topological defects similar to those studied 
in Chapter 12. If this symmetry is broken down completely, these defects are cosmic strings. 
If the symmetry is broken down to a discrete subgroup (e.g., Z2), which is often the case 
in theories with axions, the situation is even worse, since there appear domain walls. The 
latter are dangerous from the cosmological viewpoint, as we will discuss in Sec. 12.4. 


Thus, the axion is a light particle whose interactions with the Standard Model 
fields are very weak. Its mass is related to PQ symmetry breaking scale fpg by 
(9.100). The property that its interactions are weak relates to the fact that it is 
pseudo-Nambu-—Goldstone boson of a global symmetry spontaneously broken at high 
energy scale fpo > Mw. Like for any Nambu-Goldstone field, the interactions of 
axion are described by the generalized Goldberger—Treiman formula 


1 
La = Tro d Opa : Jpg: (9.107) 
Here 
ee 
Jpg = = fate. (9.108) 


The contributions of fermions to the current Jpg are proportional to their PQ 


charges a these charges are model-dependent. Besides the interaction (9.107), 


there are also interactions of axions with gluons, see (9.97), and photons, 


as a 
La = eG Ga =O 
9 Bm 7 ere = 


Pph; (9.109) 
where the dimensionless constant Cy is also model-dependent and, generally, is of 
order 1 (although may be fairly small numerically). In accordance with (9.96), the 
action (9.107) can be integrated by parts and we obtain instead 


1 = 
La = ———-0- Oy Shg = — a Om, (9.110) 
Q 


plus anomalous interactions (9.109). The interaction terms (9.109) and (9.110) 
indeed have the form (9.79), (9.80) (with P(x) = a()), i.e., models with axions 
belong to the class of models with light, weakly interacting pseudoscalars. The 
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axion mass, however, is not a free parameter: we find from (9.100) that 
107 -9 


(9.111) 


Ma Z 0.6eV - ( 
The main decay channel of the light axion is decay into two photons. The lifetime 
Ta is found from (9.81) by setting A = 27 fpo /a and using (9.111), 


pon ET waas (L). 


T= 
Paaa a? m? Ma 


By requiring that this lifetime exceeds the age of the Universe, Ta > to ~ 14 billion 
years, we find the bound on the mass of axion as dark matter candidate, 


Ma < 256V. (9.112) 


There are astrophysical bounds on the strength of axion interactions f PO and hence 
on the axion mass. Axions in theories with fpo < 10° GeV, which are heavier than 
107? eV would be intensely produced in stars and supernovae explosions. This would 
lead to contradictions with observations. So, we are left with very light axions, 
Ma < 107? eV. 

As far as dark matter is concerned, thermal production of axions is irrelevant. 
Indeed, the estimate of the mass density of thermally produced axions basically 
coincides with that of gravitino (9.70), which gives way too low Qa. 

It may seem that axion cannot serve as dark matter candidate. This is not the 
case. There are at least two mechanisms of axion production in the early Universe 
that can provide not only right axion abundance but also small initial velocities of 
axions. The latter property makes axion a cold dark matter candidate, despite its 
very small mass. One mechanism has to do with decays of global strings [150] — 
topological defects that exist in theories with spontaneously broken global U(1) sym- 
metry (U(1) pg in our case; for a discussion of this mechanism see. e.g., Ref. [151]). 
Another mechanism employs axion condensate [152-154], homogeneous axion field 
that oscillates in time after the QCD epoch. Let us consider the second mechanism 
in some details. 

As we have seen in (9.99), the axion potential is proportional to the quark 
condensate (gq). This condensate breaks chiral symmetry. The chiral symmetry is 
in fact restored at high temperatures.”° Hence, one expects that the axion potential 
is negligibly small at T > Agop. This is indeed the case: the effective potential for 
the field 6 = 0 + a/ fra vanishes at high temperatures, and this field can take any 
value, 


6; € [0 , 27), 


where we recall that the field ĝ is a phase. There is no reason to think that the 
initial value 6; is zero. As the temperature decreases, the axion mass starts to 


26We have here an analogy to phase transitions considered in Chapter 10. 


266 Dark Matter 


get generated, and the field 0, remaining homogeneous, starts to roll down from 
6; towards its value @ = 0 at the minimum of the potential. This homogeneous 
evolution is described by the Lagrangian 


om faq (d0\* _ ma(T) 
— 2 dt 2 


feof, 
where ma(T) is a function of temperature, so that 


ma(T) ~ 0 at T > Agcp; 
Ma(T) ~ Ma at T < Agcp. 


Hereafter Mma denotes the zero-temperature axion mass. 

The evolution of a scalar field in expanding Universe is studied in Sec. 4.8.1. It 
follows from that study that axion field practically does not evolve when ma(T) < 
H(T) and at the time when m,(T) ~ H(T) it starts to oscillate. Let us estimate 
the present energy density of axion field in this picture, without using the concrete 
form of the function m(T) for the time being. 

The oscillations start at the time tose when 


Ma(tose) Om (tose): (9.113) 
At this time, the energy density of the axion field is estimated as 


Paltosc) eS m3 (tose) frQh - 


According to the discussion in the end of Sec. 4.8.1, the oscillating axion field is the 
same thing as a collection of axions at rest. Their number density at the beginning 
of oscillations is estimated as 


Pa (tose) 


Hialtose) is Ma(tose) 
a\"ose 


ae Ma(tose) fol; as H (tose) fpo. 


This number density, as any number density of non-relativistic particles, then 

decreases as a~*. (We will explicitly see this in the end of this Section.) 
Axion-to-entropy ratio at time tosc is 

Na H (tose) fpo fw fra 62 


S 2 IT 3 o n 4/9xToseM pi ii 
The axion-to-entropy ratio remains constant after the beginning of oscillations, so 
the present mass density of axions is 


mi a 
yet. 
V9GxLosceM Pi ? 
In fact, it is a decreasing function of ma. Indeed fpo is inversely proportional to 


Ma, see (9.100); at the same time, axion obtains its mass near the epoch of QCD 
transition, i.e., at T ~ AQcp, 80 Tose depends on Ma rather weakly. 


(9.114) 


Na 
Pa,0 = ‘gee = 
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Fig. 9.16. Experimental constraints on the parameters of models with dark matter axions [170]. 
The inclined line “KSVZ axion” is the prediction of the KSVZ model; the yellow band around it 
shown predictions of the majority of axion models. See Fig. 13.22 for color version. 


To obtain preliminary estimate, let us set Tose ~ Agcp œ 200 MeV and make 
use of (9.100) with C, ~ 1. We find 


—6 
Qa = EL w (2) 82. (9.115) 


The natural assumption about the initial phase is 6? ~ 0.1 — 10. Hence, axion of 
mass 1075—1077 eV is a good dark matter candidate.?” This is cold dark matter: 
we have seen in Sec. 4.8.1 that effective pressure of the oscillating field is zero. This 
is of course consistent with the fact that oscillating field corresponds to axions at 
rest. 

Let us note here that search for relic axions with masses Mma ~ 107° — 1077 eV 
is a difficult but not hopeless experimental problem [155]. Limits on cosmic axions 
are shown in Fig. 9.16. 

To refine the above estimate, let us make use of the explicit formula for the 
axion mass at T > Agcp. It reads [156] 


A 3.7 
ma(T) = 0. ma(0): ( a , T> Agen. (9.116) 


27We note that axion of lower mass Ma < 1077 eV may also serve as dark matter particle, if for 
some reason the initial phase 6; is much smaller than 7/2. We note also that the mechanism of 
axion production in decays of topological defects is capable of producing right axion abundance 
for Ma > 107° eV as well. 
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We then find from (9.113) that the temperature at which axion field oscillations 
start is 


Ma 0.2 Agep 0.7 
Tose ~ 200 MeV - (a) + (seer) è (9.117) 


Note that the assumption Tose > Agcp is justified for ma > 107° eV. Inserting the 
estimate (9.117) into (9.114) we get 


(9.118) 


(dot eey\ 1 
m 0.2- P. (2E N e. 
02-8 ( ) o 


Ma 


We see that our earlier estimate (9.115) is quite reasonable, and the dependence on 
the axion mass is close to inverse proportionality. This is related to strong depen- 
dence on temperature in?® (9.116). Interestingly, similar dependence of Qa on Ma 
is obtained in models employing string mechansim of the axion production. 

Let us discuss in more detail what can be the value of the initial phase 0; 
entering the estimate (9.118). In inflationary models, this phase becomes spatially 
homogeneous either after the inflationary epoch or at that epoch. The former case 
occurs if U(1)po breaks down after inflation. Then the phase 6; is homogeneous 
over spatial regions of size comparable to the Hubble size or smaller, while on larger 
scales the phase is randomly distributed. Naive averaging over large scales gives 

a5 L Pace 
m=z? dô = y 
However, non-linear terms in the axion potential modify this estimate and yield a 
larger value [160] (82) ~ 4.5. Making use of this value in (9.118), one obtains the 
prediction for the mass of dark matter axions: 


Ma ~1-107-° eV. (9.119) 


Under the above assumption of late breaking of U(1)pq, lighter axions are ruled 
out by cosmology while heavier axions can make only part of dark matter. 

The second case occurs if U(1) pq is broken already at inflationary stage. As we 
point out in Sec. 1.7 and further discuss in accompanying book, an initially small, 
causally connected region is inflated at that stage to a region larger than the present 
horizon. Therefore, the initial phase 0; is the same in the visible Universe, but its 
value cannot be predicted. Hence, the axion mass cannot be predicted either. 

In fact, in the latter case the initial phase 0; is not quite homogeneous in space. 
At inflationary stage, vacuum fluctuations of all massless or light scalar fields get 


28 The estimate (9.116) for the temperature dependent axion mass is obtained within dilute instan- 
ton gas approximation. Recent lattice calculations, Ref. [159] and references therein, reveal larger 
numerical factor and much milder dependence on temperature, ma(T) x T~!, although Qa is 
still close to inverse proportionality to the axion mass. However, the window of ma is shifted with 
respect to estimate (9.118) by a factor of 5 towards lighter axions. Likewise, the estimate (9.119) 
becomes Ma ~ 2- 1078 eV. 
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enhanced. As a result, scalar fields become inhomogeneous on scales exceeding the 
inflationary Hubble scale H TA The amplitudes of these inhomogeneities (for canon- 
ically normalized fields) are equal to Hing /(27); this property is studied in the detail 
in the accompanying book. Applying this to axion, we observe that the initial con- 
figuration is 


(x) = A + 56;(x), 


where go is homogeneous in space, while 80; (x) is the Gaussian random field which 
exists at all cosmological scales and has amplitude squared 


Phase perturbations give rise to perturbations of axion dark matter energy density, 
which are uncorrelated with perturbations of conventional matter. These uncorre- 
lated dark matter perturbations are called isocurvature (or entropy) modes. Cos- 
mological observations show that their contribution cannot exceed a few per cent 
of the dominant adiabatic mode; see the accompanying book for details. 

Both of these cases are illustrated in Fig. 9.17, which shows the parameter 
space (Ma, ĝi, Hing). The boundary between the two cases is the line fa = Tau = 
Hing /(27). 
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Fig. 9.17. Cosmological bounds on parameters of inflationary models with axion dark mat- 
ter [171]. Axion can be dark matter particle in white regions. In the case of post-inflationary PQ 
symmetry breaking, this is the narrow strip between the regions Qa < cpm and Qa > Qcpm in 
the right part. In the case of PQ symmetry breaking at inflation, allowed is the white region in the 
left part. Besides the cosmological bounds, shown are astrophysical bound (axion effect on white 
dwarf cooling), bound on the Hubble parameter at inflation from non-observation of tensor modes 
and experimental limits (data from ADMX-I experiment and projected sensitivities of ADMX II 
and CARRACK). 
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To end this Section, let us check explicitly that the homogeneous oscillating field of 
variable mass has the property that 


decays as a~*. Let us still use the notation @ for this field. Let us write the field equation 
in expanding Universe, 


d8 dð T 
qe tH) + maT = 0. (9.120) 


Let us multiply this equation by dð /dt and find 


AN 2 A\ 2 2 
ld (2) +3H. (2) + mal) dp _o (9.121) 


dt 


We solve this equation approximately, making use of the fact that for ma(T) >> H(T), the 
following equality for averages over the oscillation period holds, 


(2 ‘) = ma(t)(6). (9.122) 


So, we obtain the equation 


dl”) 1 dma(t)\ 75 
— (su + malt) dt ) ee 


It gives 


const 


mP) = 


a3 


The left-hand side here coincides with na, in view of the fact that 


do” arig 
Pa = const - T = const - m(t) (0°). 


(The constant here equals fea for the field ð, and equals to 1 for canonically normalized 
field.) 


9.8. Other Candidates 


As other dark matter candidates, we consider sterile neutrinos in Sec. 7.3 and Q-balls 
in Sec. 12.7. Let us briefly discuss one more proposal. 
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9.8.1. *Superheavy relic particles 


Less natural dark matter candidates are particles of very large mass (we will call 
them X-particles): 


Mx > 100 TeV. 


We recall that the assumption that these particles were in thermal equilibrium 
at T = Mx would lead to overproduction of these particles; see (9.41). Hence, 
one has to assume that thermal (more precisely, chemical) equilibrium had been 
never reached. The superheavy non-thermal relic particles are sometimes called 
“wimpzillas”, see, e.g., Ref. [157]. 

Let us briefly discuss several production mechanisms of superheavy dark matter. 
We begin with production in collisions of light particles in hot plasma. This mech- 
anism works if the maximum temperature in the Universe Tmax is smaller, but not 
much smaller than Mx. Generalizing the analysis of Sec. 9.2, we find that the right 
abundance, Qx œ% 0.2 is obtained when there is a very definite relation between 
Tmax and Mx, 


Mx 
Tmax 


= 25 + - log( M? (o)), (9.123) 


where o is the production cross-section of X-particles in the plasma. The sup- 
pression of the relic mass density, as compared to the equilibrium case, is due to 
the Boltzmann factor here. Note that the right mass density of X-particles can be 
obtained at the expense of fine-tuning between the particle physics parameter Mx 
and cosmology-related parameter Tmax- 


Problem 9.17. Derive the relation (9.123). 


Heavy particles can be created before the hot stage, at the very process of for- 
mation of hot plasma. Such a process, reheating, occurs, in particular, in successful 
models with inflationary stage. We discuss reheating in the accompanying book. 
Here we only mention that production of particles at reheating may be efficient 
up to masses My ~ 10! GeV, even though the maximum temperature of nearly 
equilibrium plasma after reheating is several orders of magnitude smaller. 

Of even more exotic possibility is that inflation ends by vacuum phase transition. 
It occurs via spontaneous creation of bubbles of new vacuum, expansion of these 
bubbles and collisions of bubble walls. The collision of two walls may be viewed 
locally as a collision of particles of mass m and Lorentz-factor y. The scale of m 
is the energy scale of the phase transition; the same scale determines the final 
temperature, Tmax < m. On the other hand, the Lorentz-factor y may be large, 
and one expects that particles of mass up to Mx ~ ym can be produced in wall 
collisions. So, this mechanism is also capable of producing a heavy relic of very large 
mass. 
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Let us also note that time-dependent gravitational field produces particles as 
well. This mechanism can work both at inflationary stage and afterwards. The 
most efficient production occurs at the time when My, ~ H. That can happen at 
the radiation-dominated epoch as well. In the latter case, the number density of 
X-particles produced when Mx ~ H is given at later times by [158] 


M 3/2 
px = 5-1074- Mx- (=) . 


The present mass fraction of the heavy relic produced in this way is then 


P M; \”? 
= 109 GeV ` 
We see that this mechanism is successful for Mx ~ 10° GeV and that it overproduces 
heavy particles of larger masses. Detection of superheavy dark matter with Mx > 


10° GeV would probably mean that the Hubble parameter at the hot stage was 
never as large as Mx. 


9.8.2. Exotica 


To end this chapter, we notice that we have not considered many other dark matter 
candidates that have been proposed. These include relic black holes, new strongly 
self-interacting particles, axino (axion superpartner), mirror matter, etc. These 
exotic possibilities often require fine tuning of parameters and/or rather contrived 
cosmological scenarios. In any case, the predictions for the dark matter density are 
strongly model depenedent in the majority of the models. 


Chapter 10 


Phase Transitions in 
the Early Universe 


As we mentioned in Chapter 1, there are no direct experimental indications yet that 
temperatures above a few MeV existed in the Universe. Nevertheless, it is natural 
to assume that the Universe in the past had much higher temperatures.' Properties 
of cosmic medium were very different at different temperatures. In particular, there 
could occur phase transitions associated with the rearrangement of the ground state. 

At temperatures above 200 MeV, quarks and gluons do not form bound states — 
hadrons — and the medium is in the phase of quark—-gluon matter. At these tem- 
peratures, there is no quark condensate, i.e., the phase of unbroken approximate 
chiral symmetry is realized. If temperature in the Universe ever exceeded 200 MeV, 
then at some moment of time there was the transition from quark—gluon plasma to 
hadronic matter comprised of colorless particles: pions, kaons, nucleons and other 
hadrons. Moreover, at the same or almost the same time there must have occurred 
the chiral transition, responsible for the formation of the quark condensate. 

It is quite likely that there was an era of even higher temperatures, T = 
Mw ~100GeV. Oversimplifying the situation, we can say that at these tem- 
peratures electroweak symmetry was unbroken, and the Englert-Brout—Higgs 
expectation value was zero. When the temperature dropped, the electroweak 
transition [30-33] may have occurred, which resulted in the non-zero Englert- 
Brout—Higgs expectation value and spontaneous breaking of electroweak symmetry 
SU(2)w x U(1)y down to the electromagnetic U(1)em. 

Depending on the maximum temperature at the hot stage of the cosmologi- 
cal expansion, and on physics at ultra-short distances and at ultra-high energies, 
phase transitions could occur at even higher temperatures. Namely, if the Uni- 
verse had temperature of about 1016 GeV (which is very strong and hardly realistic 
assumption), and physics at these energies is described by Grand Unified Theory, 


lFor instance, in Chapter 9 we noted that a simple and efficient (and therefore plausible) mecha- 
nism of non-baryonic dark matter generation works at temperatures of tens of GeV or above. Many 
mechanisms generating the baryon asymmetry of the Universe (though not all) require even higher 
temperatures, from 100 GeV up to 10!5 GeV (see Chapter 11), depending on specific mechanism. 
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then there was the Grand Unified phase transition at temperature comparable to 
Meur ~ 1016 GeV. It is not excluded that phase transitions occurred also at inter- 
mediate temperatures, Mew <T &« Maur. 

It is important that properties of cosmic medium can in principle change in either 
a smooth or abrupt way. In the latter case one speaks about phase transition proper, 
while the former is known as smooth crossover. The notions of different phases 
and phase transions is particularly well defined when there exists an exact global 
symmetry,” and the phases differ by their properties with respect to this symmetry. 
In most of such cases there is an order parameter which is nonzero in one phase and 
which vanishes in the other. A condensed matter example is the phase transition 
to ferromagnetic state in metals: the symmetry in this case is invariance under 
rotations in space, and the order parameter is spontaneous magnetization. If there 
is no symmetry and hence no order parameter, then the system may exhibit either 
phase transition or smooth crossover, depending on internal or external parameters. 
An example is vapor—liquid transition, which is the first order phase transition at 
low pressure and is not a phase transition at all at high pressure. Slightly above the 
critical pressure, the properties of the medium change rapidly as the temperature 
decreases, so there is a smooth but fast crossover. 

Both possibilities — smooth crossover and phase transition — are of impor- 
tance for cosmology. Particularly interesting is first order phase transition, which, 
as we will see in this Chapter, is a strongly out-of-thermal-equilibrium process. We 
emphasize, however, that lattice data show that the transition from quark—gluon 
plasma to hadronic matter in QCD is actually a smooth crossover. Likewise, elec- 
troweak transition we mentioned above is a crossover too [173]. The point is that 
neither QCD no Standard Model have gauge invariant order parameter [180, 181] 
that distinguishes different “phases”. The absence of the phase transition in QCD is 
due to the fact that quarks have nonzero masses, and chiral symmetry is not exact. 
For the electroweak transition, it is important that the Higgs boson is quite heavy, 
mp = 125 GeV. 

The study of phase transitions in the Universe is not only of academic interest; 
it also sheds light on some of the mysteries of cosmology. Among them are the 
problems of the baryon asymmetry and dark matter. Also, phase transitions are 
responsible for possible formation of topological defects in the early Universe. first 
order phase transitions in some models lead to the generation of gravitational waves, 
which may be detected by gravity wave detectors. 

In this Chapter we recall the general classification of phase transitions and intro- 
duce methods for describing phase transitions in the early Universe. We discuss 
predominantly theories with a scalar field and are interested in transitions lead- 
ing to the generation of nonzero expectation value of this field. In other words, we 


?Gauge invariance (“local symmetry”) is not actually a symmetry; rather, it is a redundancy in 
the description which enables one to formulate the theory of spin-1 particles in a Lorentz-invariant 
way. 
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consider models that have the same structure as the Standard Model. As usual in 
field theory, the applicability of analytical methods is limited to theories with small 
couplings, but we will see that this is not enough: detailed analytical description of 
phase transitions is possible only when the vacuum value of the mass of the Higgs 
boson is sufficiently small. 

In theories where couplings are not small, analytical study of phase transitions 
from “first principles” is usually impossible; the most reliable source of information 
here are numerical methods based on lattice field theory. An important example is 
the confinement—deconfinement transition and the transition with chiral symmetry 
breaking in QCD. They occur at temperatures T ~ 200 MeV, when the QCD gauge 
coupling a;(T) is large. We will not present any detailed study of QCD transitions 
in this book, although there is no doubt that they actually occurred in the early 
Universe (assuming that temperatures T = 200MeV were indeed realized). The 
reason is that these transitions apparently left no traces in the present Universe 


(with the exception of rather exotic proposals such as the formation of quark nuggets 
with large number of strange quarks [172]). We also mention in this context axions 
as candidates for dark matter: one of the mechanisms of their generation is based 
on the very fact that the chiral phase transition occurred in the early Universe, 
while the results are practically insensitive to the dynamics of this transition; see 
Sec. 9.7. 
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Phase transitions occur because of mismatch between the properties of the ground 
states of the theory at zero and nonzero temperatures. As we show below, in theories 
with Englert-Brout—Higgs scalar field(s) this is caused by non-trivial temperature- 
dependent terms in the effective potential. At finite temperature, the equilibrium 
state of the medium corresponds to the minimum of the Grand thermodynamic 
potential (Landau potential). As we discussed in Chapter 5, chemical potentials are 
negligibly small in the early Universe at interesting temperatures T = 1 GeV, and 
the Grand potential reduces to the (Helmholtz) free energy F. Hence, we consider 
below the free energy of primordial plasma. To find the expectation value of the 
scalar field (¢), at temperature T, one considers a system in which the average 
value of the field is fixed and equal to ¢ everywhere in space, but otherwise there 
is thermal equilibrium. The free energy of such a system depends, of course, on the 
chosen value ¢, as well as on temperature. Because of spatial homogeneity, the free 
energy is proportional to the spatial volume Q, 


F = Vep (T, ¢). (10.1) 


The free energy density of medium at temperature T with the average scalar field 
uniform and equal to ¢ is called the effective potential Vep (T, @). In thermal equi- 
librium, the free energy is at minimum with respect to all macroscopic parameters, 
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Fig. 10.1. The effective potential at (a) zero and (b) high temperatures. 


including the average scalar field. Therefore, (ġ)}r is the absolute minimum of the 
effective potential Vg (T, 6) at given temperature (we will often omit the argument 
T in Vep). 

At zero temperature, the free energy reduces to the energy of the system, and the 
effective potential coincides with the scalar potential V(¢) entering the field theory 
action,’ see Fig. 10.1(a); in theories we are considering, the field ¢ has nonzero 
expectation value v. 

At finite temperature, Veg (T, ġ) does not coincide with V(¢). As a result, it 
often happens that the expectation value of the field ¢ vanishes; in this sense the 
symmetry is restored; see Fig. 10.1(b). 

Let us emphasize one point here, which applies to gauge theories with the 
Englert—Brout—Higgs mechanism; let us talk about the Standard Model for defi- 
niteness. The Englert-Brout—Higgs field ¢ is not gauge invariant in the Standard 
Model, so in strict sense, its expectation value is not a legitimate object. Gauge 
invariant operators like ¢'¢ (more precisely, H'H, see Appendix B) are invariant 
under all symmetries of the Lagrangian, and they cannot serve as order parameters. 
Thus, “phases” with “unbroken” and “broken” symmetry are in fact indistinguish- 
able. A related observation is that the notion of the effective potential V (T, ¢) makes 
sense, with reservations, only in perturbation theory. We discuss perturbation the- 
ory at finite temperature in Sec. 10.2, and consider its applicability in Sec. 10.3. 
It suffices to say here that perturbation theory is not applicable at temperatures and 
field values which are relevant for the electroweak transition. It is precisely for this 
reason that the transition is actually a smooth crossover, as we already mentioned. 
An adequate method of its study is lattice field theory. 


3In fact, even at zero temperature the effective potential does not coincide with the scalar potential 
entering the classical action. This is due to quantum corrections. In weakly coupled theories, 
quantum corrections to the effective potential are often small. 
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Fig. 10.2. The expectation value (¢ġ)r as a function of temperature for systems with (a) 1st order 
and (b) 2nd order phase transition. 


We consider in this Chapter the opposite situation, when the perturbation theory 
is applicable. In that case, there is a phase transition from (ġ)r = 0 to (¢)r # 0 
which occurs at critical temperature Te. 

Two types of phase transitions are most common; these are phase transitions of 
the first and second order. From the standpoint of the general formalism, first order 
phase transition is accompanied by jumps in quantities which directly characterize 
the system, like density. In field theory this corresponds to a jump in the expectation 
value (¢) as a function of temperature, see Fig. 10.2(a). On the contrary, 2nd order 
phase transition is characterized by continuous behavior of the expectation value 
($)r, see Fig. 10.2(b). This difference is illustrated in Fig. 10.3 where the families of 
effective potentials Veg (ġ, T) as functions of ¢ at various temperatures T are shown. 
The left panel of Fig. 10.3 shows the first order phase transition, culminating in an 
abrupt change of (¢),. The right part of Fig. 10.3 corresponds to the second order 
phase transition: the expectation value (¢), is a smooth function of temperature. 

The famous example of the first order phase transition is boiling of liquid. 
Examples of the second order phase transition are transitions in ferromagnets, 
order-disorder transitions in alloys of metals, transitions into superconducting and 
superfluid states. 

The way the transition proceeds is quite different for the first and second order 
phase transitions. We are interested in the case where the rate at which temperature 
changes in time is low compared to the typical rate of particle interactions in the 
medium; this is the case for the early Universe. For the second order phase transition, 
the medium properties (for example, the expectation value (¢),) change slowly and 
homogeneously over entire space. At every moment of time the medium is in a 
state close to thermal equilibrium. The same applies to smooth crossover. The 
situation is different in the case of the first order phase transition. Before the phase 
transition, the expectation value (¢), equals zero, but as soon as the minimum 
of the effective potential at ¢ = (¢), 4 0 becomes deeper than the minimum at 
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Fig. 10.3. Shapes of the effective potential V.g(¢) at various temperatures: upper darker curves 
correspond to higher temperatures. Left and right panels describe systems with 1st and 2nd order 
phase transition, respectively. Black circles show the expectation value (¢)r. 


Fig. 10.4. Shape of the effective potential of the system undergoing the first order phase transition. 


$ = 0, the ground state with (4), 4 0 becomes thermodynamically favorable, see 
Fig. 10.4. The transition from the state ¢ = 0 to the state @ = (¢), cannot occur 
homogeneously over entire space: the field value ¢ in such a process would evolve 
homogeneously from ¢ = 0 to ¢ = (ġ)r, and free energy (10.1) in infinite volume 
would be infinitely large in the intermediate states as compared to its initial value 
at @ = 0. The transition proceeds via spontaneous nucleation of bubbles of the 
new phase, their subsequent expansion and mergers, see Fig. 10.5. Nucleation of a 
bubble with ¢ = (¢), Æ 0 in the medium with ¢ = 0 is local in space and may 
occur due to thermal fluctuations. Bubbles expand, their walls collide, the new 
phase percolates, and after this “boiling” the system eventually returns to spatially 


4Sometimes the dominant process is quantum tunneling. 
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Fig. 10.5. Spontaneous nucleation and subsequent growth of bubbles of the new phase at the first 
order phase transition. 


homogeneous state of thermal equilibrium, but with ¢ = (¢), 4 0; the released free 
energy converts into heat. 

This boiling is a highly inequilibrium process. We have already noted that the 
most important stages of the evolution of the hot Universe are those when the cosmic 
plasma is out of thermal equilibrium. Therefore, the first order phase transitions 
are of particular interest for cosmology. 

Let us estimate the nucleation probability of a bubble of the new phase at 
temperature T. Let V- = Veg (T,¢ = 0) and V} = Vig (T, ġ = (¢) 7) be free energy 
densities of the old and new phase, respectively, V} < V_, see Fig. 10.4. The free 
energy of a bubble of size R, relative to the free energy of medium with ¢ = 0 
without a bubble, contains the volume and surface terms. The former is due to the 
fact that the free energy density inside the bubble is smaller than the free energy 
density of the surrounding medium; it is negative and equal to 


4 
ark (V =V). 


The surface term exists because the field ¢ near the surface is inhomogeneous and 
differs from both zero and (@) 7; the contributors here are effective potential Veg (¢) 
and the gradient term in the free energy, the latter being now a functional F'[¢(x)]. 
The surface term is proportional to the bubble area and equal to 47R? - u, where 
u is the free energy per unit area (surface tension). Thus, the free energy of the 
bubble, relative to the free energy of the old phase, is (see Fig. 10.6) 


4 
F(R) = 4r R?p — TR? - AV, (10.2) 
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Fig. 10.6. Free energy of a bubble of the new phase as a function of its radius. 


where 
AV =V_- V, >0 


is the difference between the free energy densities of the old and new phases (latent 
heat of the phase transition). From (10.2) we see that at sufficiently small sizes 
the free energy of the bubble decreases with decrease of R; this means that spon- 
taneously nucleated small bubble collapses due to surface tension force, and the 
system returns to the initial homogeneous state with ¢@ = 0. On the contrary, for 
sufficiently large R the free energy decreases with increase of R, i.e., the bubble 
expands and the system converts to the new phase. The minimum size at which the 
bubble begins to expand is determined by the equation 


OF 
—=0. 
OR 
Hence, it is equal to 
2u 
R. = —. 10.3 
AV ( ) 


The bubble of this size is called the critical bubble; its free energy is positive and 
equal to 
4T 16r p’ 
F, = 4r Rp — |R} . AV = ——_.,.. 10.4 
c TiteH 3° Ye 3 (AV)? ( ) 
Importantly, both the size of the critical bubble and its free energy are larger for 
smaller AV. 

Spontaneous nucleation of bubbles of the new phase in hot medium occurs via 
thermal fluctuations, i.e., thermal jumps on top of the barrier shown in Fig. 10.6. 
The probability of such a jump per unit time per unit spatial volume is mainly 
determined by the Boltzmann factor e~'</7: 


T = ATF (10.5) 
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(the Arrhenius formula), where the factor T4 is introduced on dimensional grounds, 
and pre-exponential factor A does not depend very strongly on temperature and 
other parameters. The formula (10.5) is valid for Fe > T, i.e., when the probabil- 
ity of the bubble nucleation is small. This formula together with Eq. (10.4) imply 
immediately that for finite cooling rate the medium remains in supercooled state 
with @ = 0 for some time, even though the new phase is already thermodynami- 
cally favorable. This occurs when AV = V_ — V} is still so small that the bubble 
nucleation rate is smaller than the cooling rate. In the cosmological context, the 
nucleation of bubbles is efficient when the probability of the bubble nucleation per 
Hubble volume per Hubble time is roughly of order 1, i.e., 


pt 4 T? j 
AT*e T ~ H*(T) = (=) . (10.6) 
In specific models, this equation determines the extent to which the cosmic plasma is 
supercooled before the phase transition, and the latent heat AV released as a result 
of the phase transition. These properties are model-dependent. However, one can 
make a general statement concerning the picture of the first order phase transition 
in the Universe: the phase transition begins when handful of bubbles have nucleated 
in the entire Hubble volume. Their size at the time of nucleation is determined by 
microscopic physics, and it is much smaller than the Hubble scale H~!(T), while 
the distance between their centers is comparable to the Hubble size. Bubbles have 
time to expand by many orders of magnitude before they begin to percolate, and 
only a small number of new bubbles are produced during that time. We consider 
this picture at quantitative level in the end of this Section. 

For instance, at T ~ 100 GeV (the electroweak scale) the Hubble size is of order 

H` = Mpi ~ 1cm 

The bubble size at the time of nucleation is, roughly speaking, of order TT}, i.e., 
Re ~ 107t6 cm (in fact, it is one to two orders of magnitude larger). Thus, the 
phase transition in the Universe occurs through the formation of several bubbles 
of subnuclear size in a cubic centimeter of cosmic plasma, their expansion up to 


macroscopic size and merger as a result of collisions of their walls. 


Note that theoretically there is a possibility that the phase transition does not complete 
at all in the expanding Universe, despite the fact that the effective potential has the form 
shown in Fig. 10.4. The zero-temperature scalar potential may have a local minimum at 
o = 0, the false vacuum. It has positive energy density V_, so that even in the absence of 
particles, the Universe filled with false vacuum expands at the Hubble rate 


8T 
H- =4] z OV 
3 CV- 


If the rate of bubble nucleation per Hubble time per Hubble volume is small, rT < H, 
the bubble walls do not collide, because the centers of neighboring bubbles move from 
each other with velocities exceeding the speed of light. (In other words, nucleated bubbles 
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are outside the event horizons of each other; for definition of event horizon see Sec. 3.2.3.) 
Regions of false vacuum grow faster than regions of the new phase, and the phase transition 
does not complete. 

We also note another theoretical possibility. Namely, because of the gravitational inter- 
actions, the false vacuum decay may not occur since bubbles of the true vacuum do not 
nucleate at all [174]. This takes place when the energy density of the true vacuum is neg- 
ative, and the gravitational effects are strong. The study of this effect is beyond the scope 
of this book. 


Let us discuss in general terms how to calculate the surface tension of the bubble 
wall. Let us neglect the curvature of the wall (i.e., consider the bubble of large size R) 
and take the difference between the free energies of the old and the new phase AV 
small. In this case, the configuration of the field w (r) inside the wall is the minimum 
of the free energy F'[¢(1)] considered as a functional of the inhomogeneous field ¢(r). 
At the one side of the wall, r < R (i.e., inside the bubble), the field tends to ¢ = ¢4, 
while at the other side, r >> R, it approaches ¢ = 0. If R is sufficiently large, then 
the coordinate (r — R) inside the bubble can be formally extended to —oo, and we 
write the boundary conditions as 


ow(t) > (¢)r as (r—R) > —00, (10.7) 
dw(2) 20 as (r—R) — +00. (10.8) 


We assume that the temperature corrections to the gradient term in the energy 
functional are small. (This assumption is indeed valid in weakly coupled theories.) 
Then the free energy (relative to the free energy of the old phase) can be written 
as a functional of the field ¢(r), 


F[d] = [ Anr*dr E (2) + Ve (¢) — V|. (10.9) 


The wall thickness is small compared to R for large bubble, and a slowly varying 
factor 4rr? can be treated as a constant inside the wall. Thus, 


F[¢] = 4r R? [- dř ; (2) + Veg (o) — V- (10.10) 


? 
—co 


where 
T=r—R, 


and we formally extended the integration over this variable to —oo (cf. (10.7)). 
The field configuration obeys the Euler-Lagrange equation for the extremum of the 
functional (10.10), 

ao Vegl) 


a (10.11) 
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Fig. 10.7. (a) The potential of the analogous classical mechanics problem; (b) The field configu- 
ration for the bubble of the new phase. 


This equation is formally identical to the equation of one-dimensional classical 
mechanics of a particle in the potential 


U() = —Veg(¢); 


where 7 plays the role of time. We can now neglect AV and set U((¢),) = U(0). 
Then the potential U(@) has two equally high maxima; see Fig. 10.7(a). The solution 
¢dw(r) to Eq. (10.11) describes the roll down of a particle from right hump, in 
accordance with (10.7), and subsequent roll up to left hump. The whole process 
occurs in infinite time; see (10.8). Using the analogy with the classical particle, it 
is straightforward to find the solution in quadratures to Eq. (10.11) with boundary 
conditions (10.7), (10.8), 


dw do 
do V2 (Veg =V) E 
where the limit of integration is chosen in such a way that at r = R the field ¢(r) 
takes an intermediate value ¢; between ¢ = 0 and ¢ = (¢),. The configuration 
@w(r) is shown in Fig. 10.7(b). Note that in the one-dimensional scalar field theory 


with degenerate minima of the scalar potential, this solution is called “kink”. In view 
of (10.12), the free energy (10.10) of the wall reads 


-(R-=r), (10.12) 


Fy = 4nR?x, 


where 


(o)r 
pe f V2 [Wag (#) — V_lde. (10.13) 


Note that the surface tension w is finite in the limit AV — 0. 

The expression (10.2) for the free energy of the bubble, as well as the analysis 
of the field behavior near the wall are valid when the wall thickness is small com- 
pared to the bubble size R, i.e., thin-wall approximation is applicable. According 
to (10.3), it really works if the difference of free energies AV is a small parameter. 
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Otherwise, the configuration of the critical bubble should be obtained by finding 
the extremum (saddle point configuration) of the free energy functional (10.9) with 
the only boundary condition ¢(r — oo) = 0. Details can be found in [175]. 


Problem 10.1. Check the formulas (10.12), (10.13). 


Problem 10.2. Let the effective potential be 


Vag (0) = 39? (6 0)? — 66”, 


where À, v and € are positive parameters. Find the conditions at which the thin-wall 
approximation is valid. Find the surface tension and wall thickness in the thin-wall 
approximation, as well as the size of the critical bubble Re; estimate the probability of 
nucleation of a bubble of the new phase inside the phase with ¢ = 0 in the thin-wall 
approximation at temperature T. 


Let us make a brief comment on the false vacuum decay at zero temperature. We have in 
mind scalar field models in which the scalar potential (at zero temperature) has a local 
minimum (e.g., at ọ = 0), i.e., it has the form shown in Fig. 10.8. The state in which the 
field expectation value is spatially homogeneous and equal to zero is metastable; it is the 
false vacuum. False vacuum decay also occurs via spontaneous formation of bubbles of 
the new phase, but in contrast to the medium at finite temperature, the bubble emerges not 
due to thermal fluctuations, but as a result of tunneling process [176-178]. The description 
of tunneling in the semiclassical approximation is given, for example, in [175]. In weakly 
coupled theories, the probability of the bubble nucleation is exponentially small, 


_ const 


Tae a, 


where a is the small coupling constant. Finally, in a certain range of temperatures the 
bubble formation is dominated by a combination of thermal fluctuation and tunneling. 


Let us discuss the percolation process in some detail. Let us assume that the bub- 
ble nucleation occurs in the thin wall approximation regime; this is indeed the case 
in the weak coupling limit. We will soon see that in weakly coupled theories, the 
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Fig. 10.8. Shape of scalar potential with two non-degenerate minima. 
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phase transition occurs when the temperature is only slightly below the tempera- 
ture T, at which the depths of the two minima are equal. (The latter temperature is 
denoted by T.1 in Sec. 10.2.) Then the difference between the free energy densities 
of the old and new phases is 


OAV 


av = [25% 


a) ` (Te a T), 
and the nucleation probability per unit time per unit volume is parametrized as 
follows [185] 


T= ATte CTT, (10.14) 


where 


167 u? 


> gra 
Our analysis is valid if « is not large, k < 1. This is the case for one-loop effective 
potential in weakly coupled theories; see Sec. 10.2. 

We will see that the duration of the phase transition is shorter than the Hub- 
ble time. Therefore, we neglect the expansion of space, and treat space-time as 
Minkowskian. On the other hand, it is important that temperature decreases in 
time. 

Let a bubble be nucleated in the time interval (t’,t’ + dt’). By the time t its 
size is R(t, t’) = u(t — t’), where u is the wall velocity (soon after nucleation, the 
bubble expands in stationary regime u = const). Since the bubble expands in hot 
and dense matter, this velocity is somewhat lower than the speed of light; typically, 
u ~ 0.1. Let z(t) be the fraction of volume occupied by the old phase at time t. We 
neglect the wall collisions and obtain the following equation: 


t 
4 
x(t) =1- f WTE ER, ta), (10.15) 
te 
where time te corresponds to temperature Te. Percolation occurs at time t = tp 
when the right-hand side of Eq. (10.15) becomes of order 1 (but somewhat smaller 
than 1). To estimate this integral, we set x(t’) = 1 in the integrand and write the 
pecolation condition: 


tp 4T 3 3 
dtr (t) - Bt (tp —t)? ~ 1. (10.16) 
te 
We now recall that t = (2H)~' = Mj, /(2T?). The integral in (10.16) is saturated, 
as we will see soon, at T close to Tp, and both of these temperatures are close to Te. 
Therefore 

Mý T-T, 

T? Te 


tp- t= 
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We rewrite the integral (10.16) as an integral over variable 
nT 
= 

and obtain, by making use of (10.14), 


eNi pr ; 
ZA (2) | dt (Tp — T)e 7 ~i. 
0 


Te 
We see that percolation occurs in the regime e P K < 1, i.e., when the nucleation 
rate is still small. This precisely means that (Te — Tp) < Te (recall that we assume 


that « is not large). The integral is saturated at foe T) K Tp; we write T = 7,(1—6), 
where € < 1, and obtain 


aA (Mi at To -4 a d£ Ee 278 
4 My : \" 
T 3 Pl eas p 
e A p >à Ss ~N i 1 Ll 
5 u Ge | Te 6 (Z) (10.17) 


We see that, indeed, the growth of bubbles and percolation occur in a narrow 


temperature interval, which is determined by the relation 
Ar 7? 
A= > n~ Z. 
K 


Tp 


The formula (10.17) gives for the percolation temperature 


de~ dyp o JR 
Tp ~ T — L , 
where the logarithmically large parameter L obeys 
Me 
Bet = war? (TEL) (10.18) 


We solve this equation in the logarithmic approximation. We obtain finally (leading 


logarithm and first correction) 
ME,” 
u’ AK? (=) | - tg}. 
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The relevant temperature interval is determined by the relation AT/Tp ~ L7!. 


L = log 


5It is convenient to set L = 6x, rewrite Eq. (10.18) as 


* 4 
u’ AK? (2) 
Te 


and make use of the formulas (6.18)—(6.20) and the result of Problem 6.2 in Chapter 6. 


1/6 


6x2e" = 
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Finally, let us estimate the bubble size at percolation. The volume of each bubble 
at percolation is estimated as n~!(t,) where n(t) is the number density of bubbles. 
(All bubbles occupy, roughly speaking, half of space.) Proceeding as before, we write 


n(tp) ~ [ dt T(t). 


c 


This integral is calculated in the same way as the integral in (10.16), and we obtain 


We make use of (10.18) to express exp (-4) = exp(— L), and get 


TS L”? 
n(tp) ~ M53 u3 k3/2? (10.19) 


so that the typical bubble size at percolation is 


Mpi uk!/2 uk!/? 
N 


—1 
T2 13/2 SU UTE 


R(tp) ~ n (tp) 


This is smaller than the Hubble size by a few orders of magnitude, since the velocity 
u is smaller than 1, and the logarithmic factor is large (as an example, Ll? ~ 108 
at Te ~ 100 GeV). 
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In accordance with (10.1), the effective potential is the free energy density of the 
plasma in a state where the average Higgs field takes one and the same value ¢ 
everywhere in space. The free energy F of the system is related to its energy E and 
entropy S by the thermodynamical relation F = E—T'S, so that for the free energy 
density we have 


f=p—-Ts, 


where, as usual, p and s are energy density and entropy density, respectively. We 
know from Sec. 5.2 that entropy density is expressed in terms of energy density and 


pressure, 
gah? 
T 2 
hence the free energy density is® 
f =-p. 


6The fact that the medium chooses the phase with the lowest free energy has a simple physical 
interpretation: in this phase pressure is at maximum, and sufficiently large region of this phase, 
spontaneously created inside the phase of lower pressure, will expand, “pushing away” the phase 
of lower p. 


288 Phase Transitions in the Early Universe 


So, to calculate the effective potential one needs to find the pressure of the system 
under the constraint that the average Higgs field is equal to ¢ in the whole space. 

In theories with scalar fields, particle masses depend, generally speaking, on the 
expectation values of these scalar fields. An example is the Standard Model where 
all particles (except for the Higgs boson) acquire masses due to the Englert-Brout- 
Higgs mechanism. If the field value equals ¢, then quarks and charged leptons, 
denoted collectively by f, and W=- and Z-bosons have masses 


my($) = yro, Mw($) = Yao. Mz(¢) = “4, (10.20) 


where yf are Yukawa couplings, and g and g’ are gauge couplings (see Appendix B 
for notations and details). The vacuum value is ¢ = v/V2, and we return to the 
zero-temperature formulas for the particle masses (see Appendix B), namely, 


foa 72 
E” Mw = Zu, M; = yore y, (10.21) 
In what follows we consider models of precisely this type, and for the time being we 
just assume that particle masses depend in some way on the average value of a field 
ġ, which we assume to be the only relevant scalar field at hand. Our nomenclature 
is that ¢ Æ 0 and ¢ = 0 correspond to broken and unbroken symmetry; what is this 
symmetry depends on a concrete model. 

Let us study theories with small couplings. In this Section we neglect the interac- 
tions between particles in the cosmic plasma, i.e., we consider the free energy of an 
ideal gas of elementary particles. The free energy of the gas depends non-trivially on 
the average value @, since the latter determines particle masses, and, consequently, 


Mp = 


particle contributions to pressure. For reasons given in Appendix D, the ideal gas 
approximation is called one-loop approximation in this context. 

In this approximation, the pressure is a sum of the contributions of the homo- 
geneous field ¢ itself and of each type of particles and antiparticles, 


f= Veg (T, o) =V) +) fi (10.22) 
where V(¢) is scalar potential entering the scalar field action” 


y= J [0,,¢0"6 — V(¢)] dtz. (10.23) 


The first term in (10.22) arises from the fact that the energy-momentum tensor for 
time-independent and spatially homogeneous scalar field is 


Tw (%) = Iw : V ($), 


“In the Standard Model of particle physics, the Englert-Brout-Higgs field is a complex doublet 
with kinetic term in Lagrangian given in (B.8). This explains the choice of the coefficient in front 
of the kinetic term in (10.23). The relation between the field ¢ considered here and the Standard 


Model Higgs boson is $(x) = wee. 
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i.e., the homogeneous scalar field gives a contribution to the pressure equal to p(¢) = 
Ti = To. = T33 = —V(¢). Of course, this is a reformulation of the fact that the 
free energy in vacuo coincides with the energy and its density equals V(d) for 
homogeneous scalar field. In what follows, we assume that the scalar potential is 
given by the standard expression 


vio=a(et-£), 


where v is the vacuum expectation value at zero temperature, is the Higgs self- 
coupling, and \ < 1. 

The second term in (10.22) is the medium contribution in the ideal gas 
approximation, 


fi = —pilT, mi(9)], 


where p;(T, m;(¢)] is the contribution to pressure coming from particles and antipar- 
ticles of ith type, whose mass equals m; and depends on ¢. According to Sec. 5.1, 
we have 

gi [®_  k*dk 1 


mas =e (10.24) 
672 /k2 2 \/k2 +m? 
o +m; eT `r] 


f=- 


where g; is the number of spin states, upper (lower) sign corresponds to bosons 
(fermions). Contributions of heavy particles with m; >> T are exponentially small, 
therefore the interesting case is m; ST. 

Integral (10.24) cannot be evaluated analytically. We analyze it for the particular 
case of high temperature, T >> m, and make use of the expansion in m/T (sub- 
script į will be omitted wherever possible). This approach is called high-temperature 
expansion. In dimensionless variables 


x=k/T and z=m,/T, 
the expression (10.24) takes the form 


oo 4 
-= f B E (10.25) 
0 Ve tev a] 
We are interested in the behavior of these integrals at small z. 

To the zeroth order in z, the contributions f; correspond to pressures of free 
gases of massless particles (see Sec. 5.1); they do not depend on ¢, and will be 
omitted. The integrand in Eq. (10.25) is a function of z?, so one might expect that 
I(z) is a series in z?. The first term in this series is 


dI A ode © edr 
I = 2 ——— SS 
(z) z (a). 2 J exer] + f (e41)? = “| (10 26) 


Stile, IZ): 


f= -or x 
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The integrals here are finite, so the first non-trivial term in the high-temperature 
expansion is indeed quadratic in z. Performing the integration with the use of for- 
mulas given in the end of Sec. 5.1 (in the process, it is convenient to integrate the 
second term in (10.26) by parts), we obtain in this order 


E amoi E gmo. a027) 


bosons fermions 


2 
Varl) =À (# 2 =) E 


This expression has a particularly simple form in models where particles acquire 
masses due to the scalar field condensate, 


mi() = hig, (10.28) 


where h; are coupling constants. This is precisely what happens in the Standard 
Model. The only exception is the Higgs boson itself. We will soon see that its 
contribution into the effective potential in our approximation also has the form 
(10.27), (10.28) with 


grh}, = 6A. (10.29) 


Let us first pretend that the effective potential contains the terms (10.27) only, 
and the masses are given by (10.28). Then the effective potential is 


Veg ($) = (Mv? + ZT’) g? + Agt (10.30) 


(ġ-independent terms are omitted), where 


a= E atg D ah (10.31) 


bosons fermions 


is a positive quantity. The formula (10.30) shows that when the average scalar field 
equals @, the effective Higgs boson mass squared isë (in the unitary gauge) 


meF 2g) = 6A? — Av? + ve (10.32) 


Together with Eq. (10.27) this leads to the relation (10.29). 

At low temperatures, the expression (10.30) has a minimum at ¢ 4 0 (symmetry 
is broken), while at high temperatures the only minimum is ¢ = 0, corresponding 
to the restored symmetry. Minimum at ¢ = 0 disappears and turns into maximum 


8We do not discuss here an important and difficult problem of gauge-dependence of the effective 
potential. We note only, that even though the shape of the effective potential, including the posi- 
tions of its extrema, is gauge-dependent, the values of the effective potential at extrema do not 
depend on gauge choice [317]. Similar property holds for the effective action, which generalizes the 
effective potential to coordinate-dependent background field $(x); the latter property ensures, in 
particular, gauge-independence of the rate of false vacuum decay. The issue of gauge-dependence 
of the effective potential as applied to concrete calculations is considered, e.g., in Ref. [186]. 
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when the first term in (10.30) flips sign, which happens at temperature (the notation 
will be clarified later) 


y\ 1/2 
fei (=) . (10.33) 
a 


In what follows, when discussing weakly coupled theories in general terms, we 
assume that the relevant couplings h; are small, h « 1, and the following 
order-of-magnitude relation holds, 


Aw h?. (10.34) 


The latter ensures that the Higgs boson mass mp ~ Vv is of the same order 
as masses of all other particles contributing noticeably to the effective potential. 
Formally, the relation (10.34) means that the ratio A/h? is finite in the limit h — 0. 
This does not prevent us to have A/h? < 1, which is necessary for the applicability 
of perturbation theory for studying phase transition; see Sec. 10.3. 

With this prescription, the estimate for the critical temperature reads 


Tez ~ Uv. 


This follows from (10.31) in the case of not too large number of particle species. 

In the Standard Model, the main contributions to œ come from the heaviest 
particles, W=- and Z-bosons, t-quark and Higgs boson. Equations (10.28), (10.20) 
and (10.21) yield, in terms of zero-temperature masses, 


1 
= (12My + 6Mz + 12m? + 3m;) . (10.35) 


Here we used the fact that W+- and W~-bosons together have six polarizations, 
Z-boson has three polarizations, and t-quark with its antiparticle have four; also, 
t-quark has three color states. Recalling that the Higgs boson mass is 


Mp = V2Av, 


we arrive at the following one-loop expression for the critical temperature T,2 in 
the Standard Model, 


4m? 1/2 
Lo = Ee + OMB + Im? + z) -v = 146 GeV. (10.36) 
(Be reminded that Mw = 80.4GeV, Mz = 91.2GeV, m % 173GeV, Map = 
125 GeV, v = 246 GeV; see Appendix B). As we already mentioned, perturbation 
theory is not adequate for describing the electroweak transition, so the result (10.36) 
is to be considered as an approximate estimate of the relevant temperature. Numeri- 
cal calculations give the estimate Te œ 160 GeV for the transition temperature [199]. 

If the high-temperature expansion of the integrals (10.24) were really a series in 
z? = m? ($) /T?, one would conclude that we are dealing with the second order phase 
transition: corrections of the fourth and higher orders in ¢ are small compared to 
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terms written in (10.30) (see below), and the position of the minimum of the expres- 
sion (10.30) smoothly moves from ¢ = 0 towards large ¢ as temperature decreases 
from Tez to zero. In other words, the behavior of the expression (10.30) corresponds 
to the right plot in Fig. 10.3. However, integrals (10.24) are not analytic in z?, 
and the one-loop effective potential actually corresponds to the first order phase 
transition. The lack of analyticity can be seen from the behavior of contributions 
to the integrals (10.25) coming from the low-momentum region k < T, i.e., x &« 1 
(infrared region). For small z and x, the expansion of the exponential terms in the 
integrand gives 


A 4 d: 
{= | eaa bosons 
o “+z 
N (10.37) 
1 xtd 
pe) = = J —=—, fermions, 
+ 2 Jo vz? + 22 
where A < 1 is a fictitious parameter separating the infrared region. Formal expan- 
sion of the integrands in these formulas in z? would result in the order z* contribu- 
tions of the form 


Ag 
x 
zt | bosons 
0 


z2’ 


á ^ dx . 
zZ —, fermions. 
0 x 


The first of them linearly diverges at the lower limit of integration, while the second 
one diverges logarithmically. We can therefore expect that besides the above terms 
of order z?, bosons and fermions give contributions of order z*, and z* log z, respec- 
tively. Contributions of the latter type give the terms in the effective potential of 
the form 


P 
c= 


m3(@) log hi ¢* log 2, (10.38) 
These are of little interest from the viewpoint of the phase transition, because at 
A © h? (which holds for not too small Higgs boson mass), they are small compared 
to Ad* coming from the scalar potential V(¢). (Only contributions due to t-quark 
are important in determining certain parameters of the phase transition.) On the 
contrary, terms of order z3 are very important: it is due to these terms that the 
transition (in the one-loop approximation) is of the first order. 

To calculate the term of order z? in the boson integral J_, we divide this integral 


into two parts using the fictitious parameter A, 


o pl 
g dz 1 (IR) 
L- = -= oO HI 
[ g? + 2z? eve te — 1 
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The first term is analytic in z?, while for the second term the approximation (10.37) 
is sufficient, i.e., 


A A 
i =| (£? — 2°) da +24 f = . 
0 0 H + 22 


The first term here is again analytic in z?, while the second term gives the contri- 
bution of order z2? we are after, 


4 
e Esso(2). 


As a result, the effective potential in the one-loop approximation reads 


Ves ($) =A G - =) + a ( XO gim?($) + 5 anit) 


bosons fermions 


a hae X gim3(¢) + O (mio log | l 


In models where particle masses are related to the expectation value of the scalar 
field by (10.28), this expression is rewritten as 


Ver ($) = 55 (P-TA) P -TP + 4%, (10.39) 


where the parameter y is positive and equal to? 


yee DY ai E a (EYF. (10.40) 


bosons bosons 


Here we use the notations introduced in (10.31) and (10.33). 


Problem 10.3. Calculate the terms of order ¢*log $ in the high-temperature 
expansion of the effective potential, using the relation (10.28). Show that at \ ~ h? 
and hi <1 these terms are small compared to those written in (10.39) in the entire 
interesting range of ¢, namely, 0 < Sv. 


The behavior of effective potential (10.39) corresponds to the left plot in 
Fig. 10.3, i.e., to the first order phase transition. Extrema of the effective potential 


°F ffective mass squared of the Higgs boson mef ? = a: (T? — T2 )/24 is small near the phase 
transition, so the Higgs boson itself gives small contribution to the cubic part of the effective 
potential, i.e., to the parameter y. Another qualification is that the time components of gauge 
fields acquire the Debye mass mp ~ gT > g®c, see (10.46), so their contributions to y are also 
absent. In other words, one should set g; = 2 in Eq. (10.40) for each of the vector bosons (rather 
than gi = 3 as in Eq. (10.31)). 
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are determined by the equation 


a ~ - (T? — TS) $ — 37T? + 49? = 0. (10.41) 


At temperature Teo such that 


4ar 
99° To = (T ~ Te) , 

the effective potential acquires two extrema at ọ Æ 0: minimum and maximum. 
This temperature exceeds T,2 only slightly: for A ~ h? we have y ~ h?, aà ~ hf, 
and thus 

T2 =T 2772 

ao n T np, (10.42) 
Tio 4ar 
Due to the small difference of T and Ty in the interesting temperature range, one 
can replace T by Tez in the second term in (10.41). It is important that the second 
minimum of the effective potential (if d = 0 is treated as the first minimum) appears 
at nonzero ¢ = ®.(T,0), 


eg = (Tp) = LT. (10.43) 


As temperature decreases, the second minimum becomes deeper, and the values of 
the effective potential at this minimum and at the minimum ¢ = 0 become equal at 
temperature T,1, such that both Eq. (10.41) and equation’? V.g = 0 are satisfied. 
The solution to the latter system of equations gives the first critical temperature Ter, 


TA -Th _ 6? 


T a’ 
and the position of the second minimum at this temperature, 
da = ®,(Ta) = aa (10.44) 


In view of (10.40), this value is much smaller than the critical temperature, 
®e(Ta) ~ hTea in the weak coupling limit h < 1. Once temperature decreases 
down to the second critical temperature Te2, the minimum of the effective potential 
at @ = 0 disappears and turns into maximum. At that instant the second minimum 
is at 

37 
4A 
Thus, the evolution of the one-loop effective potential shown in the left plot in 
Fig. 10.3 takes place in weakly coupled theories in a narrow temperature interval 
near the critical temperature (10.33), Te2 < T < Teo, where Teo is defined by (10.42). 


Ba = e(Ta2) = Teo. (10.45) 


l0Recall that we dropped ¢-independent terms in the effective potential, i.e., we set 
Veg (¢ = 0) = 0. 
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Immediately after the phase transition, the expectation value of the Higgs field is 
much smaller than its vacuum value, 


De~ {To ~ hToo ~ hv Xv. (10.46) 


Note that the high-temperature expansion is justified in weakly coupled theories, 
since 


m;(®) = h,® ~ KT < Too: 
Another point concerns the latent heat of the phase transition. At T = Te2, the value 
of the effective potential at the minimum (10.45), relative to its value at ¢ = 0, is 
27 7f 4 
256 43" 


This value (with opposite sign) gives the maximum latent heat of the transition, 
which is roughly equal to 


Veg (Tez, ®.(Te2)) => 


Viger hT K Th: (10.47) 


Thus, the energy released during the phase transition is small compared to the 
energy of particles in the plasma, whose density is of order T4. The cosmic plasma 
is heated by the phase transition only slightly. 

Let us estimate the surface tension of a bubble of a new phase. The effective 
potential at T = Tı has the form 

2 
Veg = 6? (9 - Z Ta) - 

The expression (10.13) has to be modified, since the kinetic term is non-canonical 
in (10.23), so we get 


The difference between free energy densities of old and new phases obeys 


dAV ƏV (Be) Pe AV(®.)  ƏV(Be) an ( m) 3 
T= = in |e, 


ar °° 96, ôT ðr ðr BAX OA 


This gives the following expression for the parameter « in (10.14): 
_ 84 7° 
9 N7/2 2 (1 7 ee) 
In the case we consider, we have a ~ h?, A ~ h?, y ~ h®, hence K ~ hi, i.e., 


the parameter « is small in the weak coupling limit. We have used this fact in 
Section 10.1. 
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Problem 10.4. Estimate critical temperatures, field values at minima of the effec- 
tive potential, latent heat and bubble surface tension in weakly coupled theory whose 
gauge and Yukawa couplings are of order h, while the scalar self-coupling is of the 
third order, A ~ hè. 


High-temperature expansion of the integrals (10.24) obviously does work at small 
values of ¢. Therefore, the result that within the one-loop approximation the phase 
transition is of the first order, is justified from this point of view. Of course, the use 
of the high-temperature expansion is not necessary for one-loop calculations. Inte- 
grals (10.24) are easily computed numerically, and thus the exact one-loop effective 
potential is straightforwardly obtained. The corresponding graphs for the theory 
coinciding with the Standard Model but with wrong Higgs boson masses are shown 
in Figs. 10.9 and 10.10. It is seen that at @ < T, results for the effective potential 
obtained by using the high-temperature expansion are in reasonable agreement with 
the exact one-loop calculation. At the same time, the results for a number of char- 
acteristics of the phase transition agree only qualitatively, within a factor of 3 or so 
(see Fig. 10.11). For analytical estimates of these characteristics, the omitted higher 
order terms in m/T are important. Indeed, the ratios ®e(Te1)/Te1 and ®.(T 2) /Te2 
are inversely proportional to the coefficient in front of ¢* in the effective potential 
(see formulas (10.44), (10.45)), and this coefficient gains a significant contribution 
when the omitted higher-order terms in (10.38) are taken into account. 


Problem 10.5. Making use of the high-temperature expansion, calculate the values 
of ®.(Te1)/Ter and ®.(T.2)/Te2 taking into account the contributions of the form 
(10.38). Check that the results are in better agreement with exact numerical results. 


V(¢,T) / V(¢,T) 
mh / 0 
/ o 
T = 70 GeV / 
/ 0. 
7 
7 o 
Zz 
20 40 60 80 100 120 140 20 40 60 80 100 120 140 
$, GeV b, GeV 
V(o,T) 
mh i 
T=50Gev / 
/ 
/ 


Fig. 10.9. One-loop effective potentials at different temperatures, obtained numerically (solid 
lines) and analytically within the high-temperature expansion (dashed lines) for the Standard 
Model with the Higgs boson mass mp, = 50 GeV. Note the difference in scales of the axes for 
different temperatures. 
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Fig. 10.10. The same as in Fig. 10.9, but for the Higgs boson mass mp, = 150 GeV. 
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Fig. 10.11. Comparison of numerical results (black lines) with the analytical results obtained 
using the high-temperature expansion (gray lines) for the one-loop values of (a) ®¢(Te1)/Te1 and 
(b) #e(Te2)/Te2 at various Higgs boson masses mp. 


Much more serious issue is the applicability of the one-loop approximation itself, 
and, more generally, the applicability of the perturbation theory at finite temper- 
atures. These issues are discussed in Sec. 10.3. An important conclusion is that in 
the Standard Model, the one-loop approximation does not describe the transition 
correctly, given the experimental value mp = 125 GeV. 


Let us describe a simple extension of the Standard Model where the Englert—Brout—Higgs 
state emerges as a result of the first order phase transition [187]. The idea is that the 
elctroweak transition may occur from a state with non-vanishing condensate of a new 
scalar field, rather than from a state with vanishing condensates. The phase with the 
condensate of the new field may be separated from the electroweak phase by a potential 
barrier, hence the first order phase transition. 
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We add a new scalar field s to the Standard Model, and take s to be a singlet under 
the Standard Model gauge group. We require for simplicity that the action is invariant 
under discrete symmetry s — —s, then the only additional renormalizable terms in the 
Lagrangian are 


Ls = 5 (Ons)? — (5? — 2)? — xHt Hs’, 


where H is the Englert-Brout—Higgs doublet, As and x are dimensionless couplings and vs 
is a new parameter of dimension of mass. Hence, the scalar potential at zero temperature 
is 


2\ 2 
V(H,s) = (n'n z =) + Zeqa? — v2)? + «Ht Hs?, (10.48) 


where the first term is the scalar potential of the Standard Model. The potential V(H, s) 
has extrema at 


H= a s=0 (10.49) 


and 
H =0, sS = Us. (10.50) 
Let us impose the following conditions on the parameters: 


dv? > xo, (10.51a) 
Me ANa (10.51b) 


Then there are no other nonzero extrema of the potential (10.48), the extremum (10.50) is 
unstable (saddle point of the potential), and the standard electroweak extremum (10.49) 
is the only minimum. 


Problem 10.6. Prove above statements. Show that the conditions (10.51) ensure the sta- 
bility of the vacuum (10.49). 


Let us now consider the one-loop potential at finite temperatures. It is sufficient for 
our purposes to include only terms which are quadratic in temperature, and consider the 
effective potential along the axes (H # 0, s = 0) and (H = 0, s #0). The terms quadratic 
in temperature are in general given by the second term in (10.27). At (H #0,s = 0), we 
have to include contributions due to W~-, Z-bosons, t-quark and also the Higgs boson 
and s-boson. It is convenient to introduce canonically normalized field 


x = V2¢ = V2|HI. 


Then the mass squared of the Higgs boson at x # 0,5 = 0 is 


m? = 7 =3Ax +, (10.52) 


m2 = xx’, (10.53) 
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where we again omit x-independent terms. Then the quadratic in T part of the effective 
potential along the axis (x 4 0, s = 0) is 


1 
AVeg = zal x’, (10.54) 
where 
1 
ay = 7 (997 + 3g’? + 12y? + 12\ + 4%) . 


The four first terms here agree with (10.30), (10.35), and the last term comes from (10.53). 
It is worth noting that the terms omitted in (10.52), (10.53) do not yield contributions 
into AV.g which depend on x. Thus, the effective potential along the axis (x 4 0,s = 0) 
reads 


1 
Veg (x; 8 = 0) = 5ax(T? — Ty)x? + 7X", (10.55) 


where 


is the critical temperature for this axis, and we normalized the potential in such a way 
that Veg (0,0) = 0. 

We now calculate the effective potential along the axis (x = 0,s #0). At s Æ 0 there 
are four bosons making the complex doublet H, whose masses are m? = xs”, while for 


s-boson we have m2 = 3A,s” +.... Other particles do not acquire masses. Therefore, 
= a! 2 2\.2 Às 4 
Veg (x — 0, s) — ze (T = )s T Ei ’ 
where 


1 
eS s 4 5 
a 79 (34 + 4s) 


1/2 
T,= (=) f , 
Qs 
Let us choose the parameters in such a way that 


Toei (10.56) 


Due to large t-quark mass, i.e., large y+, this condition is not particularly restrictive. As an 
example, one can choose Às ~ A, x & A, vs & v (and also satisfy the inequalities (10.51)). 
Then 


T? _ 99° +3g? +12y? +16 5 
T2” 28A D 
This shows that the inequalities (10.51) and (10.56) can be satisfied without fine tuning 
of parameters. 

Once the condition (10.56) holds, then the cosmic medium first (at T ~ Ts) evolves 
into the state with x = 0,s 4 0. As the temperature drops down to T ~ Ty, the second 
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minimum of the effective potential emerges at x # 0, s = 0. The field values at co-existing 
minima and their depths are 


1/2 2 
— | Gs p22 i — Gs 2 _ m2y2 
s= [E-D] o V= os) = -T T) 
and 
— |x (7? _ T? a _) — Oy 2 22 
=(SaR-7)] , Vrs =0) = -XR - T). 


The minimum y # 0, s = 0 has the same depth as the minimum y = 0, s # 0 at 


7 T2ax/ VA — T2as/Vrs we VX VAs 
ay/VR—as]V¥e eN 
At temperature below Te, the minimum x 4 0,s = 0 is deeper, and the first order phase 


transition occurs. Importantly, x(Te) is not small compared to Te in a large region of the 
parameter space, so our perturbative treatment is legitimate; see Sec. 10.3 in this regard. 


T2 


Problem 10.7. Let us set x = As for simplicity. Find the range of parameters A; and 
us/v where the relations (10.51) (10.56) hold, and, furthermore, x(T-) > Te. We note that 
models where the latter inequality is satisfied are of particular interest from the viewpoint 
of electroweak baryogenesis; see Sec. 11.5. 


Problem 10.8. Find the range of parameters where the relations (10.51) (10.56) and 
inequality x(Te) > Te hold, and the scalar s is a sizeable component of dark matter; see 
Problem 9.4 in Chapter 9. Can scalars s make all of dark matter? 


10.3. Infrared Problem 


In this Section we show that the finite-temperature perturbation theory is not always 
applicable, even if couplings are small [156, 182]. Let us discuss this issue at the 
qualitative level first. The physical reason for failure of the perturbation theory is 
that the distribution function of bosons fs = [exp(w/T)—1]7! is large at low particle 
energies w. Owing to that, the interaction between the bosons at low energies is 
enhanced in the medium. Indeed, for small momenta and masses of particles, p< T, 
m & T, the bosonic distribution function has the form 


In quantum field theory, this means that number densities of low-momentum 
particles, 


((al,ap’)) = fs(p)6(p — p’), 


are large in comparison with the commutator 


(([ap,a,])) = 6(p — p’), 
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provided that the particles are light, m(T) < T. Therefore, the infrared part of a 
light boson field ® is a classical field. The linearized expression for it is 


3 
(x) = al d'p (ca, +e Pta] ), 
(27)3/2 wp P 


where ap, al, can be treated as c-numbers at low momenta. Hence, omitting numer- 
ical factors, we obtain for the field fluctuation 


dp 
(a) = | Pho. 
We are interested in the contribution from the infrared (IR) region, 


p’dp T aa dp p’? 
Bp Üp p we 


((®2(c))) ie = / 


Thus, the amplitude of the field fluctuations with momenta of order p is estimated as 


2 p? 

((®*(x)))p = oe 

The field is in the linear regime, if the quadratic (free) contribution to the free 

energy is larger than the contribution due to interactions. At p? > m?, the free 

contribution can be estimated as (V®)? = p?®?. Following the convention adopted 

in this Chapter, we choose the self-interaction of the field ® as h?®*, where h is 

a small coupling. Comparison of the interaction and free contributions to the free 
energy gives 


n2o4 h2T 26 gy ol nT 
ae (rZ) a (10.57) 


(V®)? wi Pi 
This implies for the massless field 


h? t kT 
2° 


(V®) p 


The field is in the linear regime if this ratio is small, which is true only at p >> h?T. 
In the infrared region, p < h?7, on the contrary, the regime is strongly non-linear, 
and the perturbation theory is inapplicable. For massive field, the ratio (10.57) is 
small for all momenta, only if mẹ >> h?T. Otherwise, the infrared region is at strong 
coupling. Note that this is the strong coupling regime in the classical field theory 
at finite temperature. 

In gauge theories with Englert-Brout—Higgs nechanism, this argument applies 
to spatial components of non-Abelian gauge field, whose interaction contains, 
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in particular, the commutator term of the type g?A*. Thus, instead of h? we have 
g’, and the condition of applicability of the perturbation theory has the form 


Mw (TL) > 9°T, 


where Mw (T) is the mass of the gauge field. For the component Ag the argument 
is inapplicable because of the Debye mass mp ~ gT > g?T. 

Thus, the perturbative calculation of the effective potential is justified when 
My (T) > g°T, i.e., 


o> gT. (10.58) 


We see that the effective potential cannot be perturbatively calculated near ¢ = 0. 
Moreover, the description of the first order phase transition presented in the previous 
Section is valid only when the position of the second minimum of the effective 
potential given by Eqs. (10.43), (10.44) or (10.45), satisfies ®. >> gTe. In terms of 
couplings, this restriction has the form 

Les. 

g 

Given that y ~ g? (see (10.40)), and omitting numerical factors (they actually 
work towards stronger restriction), we get from this A < g?, i.e., in terms of the 
zero-temperature mass 


m? < Mg. (10.59) 


It is difficult to refine this estimate. In other words, it is impossible to find 
analytically up to what values of mp the phase transition is of the first order. Still, 
the restriction (10.59) suggests that the one-loop results of Sec. 10.2 most likely 
have nothing to do with reality in the Standard Model, taking into account the 
actual value mp = 125 GeV. 

Since the difficulty is due to interactions of low-momentum particles, it is called 
the infrared problem. The main role here is played by the self-interactions of gauge 
bosons inherent in any non-Abelian gauge theory. 


Let us see at the formal level that the applicability of the perturbation theory for calculat- 
ing the effective potential is indeed limited to the values of the scalar field obeying (10.58). 
As shown in Appendix D, the effective potential is given by the functional integral 


7 PVP) = [PAu m, (10.60) 


where we neglected all but gauge fields. Here 8 = T+, the functional ge [A] is the 
Euclidean action in the Euclidean time interval 0 < T < £, 


B 2 
SPIA] =f dr | x Fa re, + LO ge 4h] (10.61) 
0 


uv pv 2 H 


summation over indices is performed with the Euclidean metric. The integration in (10.60) 
is over fields A, (x, T) periodic in 7 with period 8. 
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Because of the periodicity, the field Ab, (x,7) can be represented as a discrete sum 


1 1 ; 
Au (x, T) = —a (x) + a xe", (10.62) 
where 
27n 
Wn = — = 2anT 
B 


are the Matsubara frequencies, and we omitted the group index and extracted explicitly 
the terms in (10.62) with zero Matsubara frequency. Upon substituting expansions (10.62) 
into (10.61), we arrive at the action of 3-dimensional Euclidean theory with an infinite 
set of fields a,(x), al” (x). We are interested in the infrared region, more precisely, in the 
region of spatial momenta 


|p| < gT. 


At these momenta, only light 3-dimensional fields are relevant, whose masses are much 
smaller than gT. Note first, that ao, ai”) are not k fields: they acquire the Debye mass 
mp ~ gT (see Sec. D.5). Therefore, the fields ao, ai”) can be ignored. Now, the fields al” 
with n Æ 0 are also heavy: the term Fo; Fô; in the original Lagrangian leads to the term in 
the action 


D f Paaa (=n), 
2 


rA E r 


i.e., the mass term in the 3-dimensional theory with large masses |wn|. As a result, the 
only light fields are a;(x), i.e., homogeneous in Euclidean time components of spatial 
vector potentials A;. Substituting Ai(x) = G~'/?a;(x) into the action (10.61), we obtain 
the effective 3-dimensional action describing the infrared properties of the theory at finite 
temperature, 


si [ae (Fries + Tm? (o)ata!) (10.63) 
where 


fig = O:a} — ja? + gVT f” aa] (10.64) 


and f° are structure constants of the non-Abelian gauge group. (In the case of gauge 
group SU(2) these are f’°¢ = 7.) The factor T!/? = 871? emerges in (10.64) due to the 
normalization in (10.62), chosen in such a way that quadratic part of the 3-dimensional 
action (10.63) has canonical form. 

As a digression, we make an observation regarding fermions. Within the Euclidean 
approach all 3-dimensional fermions are heavy: they are antiperiodic in Ø, and their 
Matsubara frequencies w, = 2nrTn’', n = 4 E4, 4 Ee ica are all of order T. Therefore, 
fermionic fields are insignificant for the infrared properties of the theory. This also follows 
from the consideration given in the beginning of this Section: the distribution function of 
fermions fr = [exp(w/T) + 1]~' does not diverge as w — 0, in accordance with the Pauli 


principle. 
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Fig. 10.12. An example of a diagram contributing to the effective potential. 


Coming back to the action (10.63), we note that it is the action of 3-dimensional vector 
fields with mass M(¢) and dimensionful coupling 


g® = gvT. (10.65) 


The ratio of dimensionful quantities [g]? /M (¢) is the effective coupling constant. Indeed, 
in the framework of perturbation theory, the effective potential (more precisely, — 8V (¢)) 
is given by the sum of one-particle irreducible diagrams without external lines (see 
Appendix D) of the type shown in Fig. 10.12. For our purpose, these are diagrams in 
3-dimensional theory with the action (10.63). Diagrams with n loops give contributions 
to GBVeg proportional to [g Ope». On dimensional grounds, these contributions are of 
order 


oy BOPE a a [BOPY” 
PV ~ or a T MC) (a) 


Thus, the expansion parameter in the perturbation theory is [g®]?/M(¢), i.e., the 
perturbation theory is applicable at M/(¢) > g°T. This confirms the result of the qualita- 
tive analysis given in the beginning of this Section. 

To conclude this Section, we make a few remarks. First, we note that at momenta and 
energies small compared to g?°T, the theory is effectively reduced to a 3-dimensional gauge 
theory (in general, with scalar fields) with gauge coupling (10.65). Such a theory exhibits 
quite non-trivial nonperturbative properties. These have been explored in a number of 
lattice studies. 

Second, for the fields A;(x) independent of Euclidean time 7, the partition function 
(10.60) is 


e SF = | bajen, 


where 


H[Ai] = [ae (Frins aes ‘ ; 
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and dots denote terms involving possible scalar fields. (Here we have preserved the four- 
dimensional normalization of the vector field.) The expression on the right side is the 
partition function of classical four-dimensional gauge field theory (in general, with scalar 
fields) with the Hamiltonian H[A;]. Thus, our analysis leads to understanding [183, 184] 
that the behavior of quantum theory at high temperatures at length and time scales 
comparable to or in excess of (g?T')~', basically coincides with the behavior of classical 
field theory.’ This fact is useful for studying static, and especially dynamical (showing up 
in time evolution) properties of the theory at high temperatures. 


10.4. *Electroweak Vaccum at Zero Temperature 


This Section is a digression from the theory of the early Universe: we are going to 
discuss an intriguing property of the Standard Model at zero temperature. 

Classical (tree level) scalar potential is not, generally speaking, the right object 
to study when considering vacuum of the theory. The correct one is the zero temper- 
ature effective potential, which includes radiative corrections [189]. If one assumes 
that the effective potential for the Englert-Brout—Higgs field can be calculated 
within the Standard Model,!? then it is quite likely that it has a minimum at high 
field value ¢ >> 100 GeV, and that this minimum is deeper than “our” minimum 
ġġ = 246 GeV. This would mean that the Universe resides in false, metastable 
vacuum. 

The effective potential that incudes one-loop corrections has the same form as 
in (10.22), where the contributions of various particles are calculated in Section D.4 
(see Eqs. (D.38), (D.43)), 


. 3 
f(T = 0,8) = | EE yp mle), (10.66) 


where the positive (negative) sign is characteristic of bosons (fermions). Here gi 
denotes the number of internal degrees of freedom (each Dirac fermion has g; = 4). 
Integral in (10.66) is ultraviolet (UV) divergent. This is rather general situation in 
quantum field theory; the UV divergence is removed by renormalization of “bare” 
parameters in the Lagrangian; see, e.g., Refs. [268, 269, 271]. 

In the case at hand, we regularize the integral in (10.66) by cutting it off 
at |p| = Ayy, where Avy is cutoff parameter which will eventually be sent to 
infinity. This procedure is known as momentum cutoff regularization (we use its 


11 There is a subtlety associated with the need for ultraviolet cutoff in the classical field theory: 
for the Rayleigh—Jeans distribution, the energy density diverges in the ultraviolet region. 

12This is a very strong assumption, implying, in particular, that new physics, responsible for 
neutrino oscillations, dark matter and baryon asymmetry of the Universe, does not induce sizeable 
corrections to the parameters of the Standard Model even at high energies. 
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3-dimensional version). We find the regulatized contribution 


i Auv 
f= | Pdp yip + m6 (10.67a) 
0 
4 2 
i i(@) i(d) | 1 
= phy Ab + Abvini(o) + THE (m TRA 4. 5) + 00A); 


(10.67b) 


In renormalizable theories, the “bare” (tree level) potential is a fourth order poly- 
nomial 


VO) = pO + Pd + m2 OP? + LG + AMG. 


Parameters p, m?, \) (as well as et} are independent of ¢ and are bare cos- 
mological constant, bare mass parameter and bare self-coupling of the field ¢. There 


are no linear and cubic terms in the Standard Model, oh = cf) 


= 0. In renormal- 
izable theories, mass squared m?(@) in (10.67b) is a polynomial of at most second 
order in @¢, so the effective potential with onee-loop corrections has the following 


structure: 


VaVO4S = VO EVER EV, 


where 
4 2 
div _ 5 `+ Ji 4 2 2 mi (o) H 1 
= dp, + 8c : $ +m? - p? + 6c3- 62+ 6A- G4 (10.68b) 


is UV divergent part with parameters dp,,...,0A, depending on Ayy (the second 
and fourth terms in (10.68b) are again absent in the Standard Model) and the term 


Gi m?(¢) 
VO = eee: m3(¢) In - 


i 


is finite! as A — oo. Note that we introduced an arbitrary parameter p of dimension 
of mass. This is the renormalization scale, which enters also dp, 6m”, 6\ and 6c1,3. 
Physical quantities must be independent of u. 


13Note that finite (in the limit Ayy — oo) polynomial terms which have the structure (10.68b), 
can be reshuffled between V“" and V(®). This arbitrariness corresponds to arbitrary choice of the 
renormalization scheme. 
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We define the renormalized parameters 


m? (u) =m? + ôm? 


Alu) = AO + 8A 


(and similarly for pa and c1,3) and treat them as finite in the limit Ayy — oo (the 
bare parameters are divergent in this limit). This is precisely the renormalization 
of the one-loop effective potential. The renormalized effective potential 


V = pa (u) + cr(u)b + m(u)G? + c3(u)d? +A) + VO 


is finite as the regularization is removed. The parameters entering V“) are to be 
treated as renormalized: the difference between bare and renormalized parameters 
in V“) is the second loop effect which is neglected in one-loop approximation. 

We are interested in the behavior of the effective potential at large ¢. We have 


om! m2 
VA) = Audet + eA i 


(10.69) 


The explicit dependence on pu in this formula is compensated for by the depen- 
dence of self-coupling A(z). This is the simplest manifestation of the renormalization 
group. 

Since the particle masses are proportional to couplings to the scalar field, the 
second term in (10.69) leads to corrections to the effective potential which are 
proportional to powers of these couplings: the heavier the particle, the larger its 
couping to the scalar field and the larger its contribution to V(@). In the context 
of the Standard Model, the main contributions come from t-quark, Z- and W=- 
bosons and the Higgs boson itself. To the leading order in ¢, we find in the Landau 
gauge!4 


3M$(o) ,, M3(8)_, 6MA(d) Malo) _ 12mé(d) | m2) 


4 
= e ——_—_—>—_— ji m 
Vee 64n2 L? 64r? u? 647? j u? 
36A? 6AP? 12A? | 24? 
+—— | l 10. 
6472 pe? Gln? u’ e 


where particle masses depend on the scalar field as in (10.20). 


14Here we deviate from our analysis in Sec. 10.2 where we use the unitary gauge. This subtlety is 
not very important, however. 
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The last two terms in (10.70) come from the scalar field itself. Let us explain their origin. 
We decompose the Englert-Brout—Higgs field into background part ġ and quantum part: 


0 1 Ei + 1&2 
H = — = : 
Cees t 


The quadratic part of the kinetic term in the Englert-Brout-Higgs Lagrangian is 


3 
(DH)! DpH = 5 (Auk)? +Y 5 (Oxa)? — Bt Ân duo + 0,9'A,® + 814,48, (10.71) 


a=1 


where we write in somewhat sloppy way D, = Ou + Â, with anti-Hermitean Au: The last 
term in (10.71) is the mass term for W=- and Z-bosons. The Landau gauge O,A, =0 
is particularly convenient here, since the cross terms (the third and fourth in (10.71)) 
vanish upon integrating by parts, and the fields h and a have canonical kinetic terms. 
At large background field ¢, the tree level scalar potential, up to quadratic order in h and 
Eiis reads!® 


where 
malp) = 6A$", m, (¢) = 24”. 


We substitute these masses into the general formula (10.69), and obtain the second line in 
(10.70). 


The one loop effective potential (10.70) can be written as follows: 


¢ 


pe? 


, 3M3 + 6Miy — 12m4 + 3mģ 


4 
16724 oe 


V(9) = Xu) ¢* (10.72) 
where the correction is written in the leading logarithmic approximation, and the 
couplings are expressed through “our” particle masses. Because of the large t-quark 
mass, the second term here is negative, so the effective potential becomes negative 
at large ¢. Thus, within the one loop approximation, “our” vacuum ¢ = v/V2 is 
metastable, rather than absolutely stable. 

One loop approximation is, however, insufficient for deciding whether “our” 
vacuum is stable or metastable. Let us set u? = v?/2 in (10.72) and find that 
modulo small corrections \,,_,,, yg = ™mj,/2v? = 0.13. Then the one loop effective 
potential (10.72) becomes negative at ¢ = 10° GeV. For so large field values, the 
higher loop effects become important. These can be incorporated in the following 


15The term linear in h does not contribute to the action, since the homogeneous part of the scalar 
field is set equal to ©, and therefore f d*x h(x) = 0. 


10.4. * Electroweak Vaccum at Zero Temperature 309 


0.10 
0.08 | 30 bands in 
M, = 173.1 + 0.6 GeV (gray) 
a3(Mz) = 0.1184 + 0.0007(red) 
= 0.06 - Mp = 125.7 + 0.3 GeV (blue) 
an 1.0 r 
e ay 
= 0.04} Y 
8 ost > 
2 
El E 4 3 
E 0.02 2 0.6. 
E me T- o apiaaanaan 
ED z -a i = 
= 000 8 oal = 
= 
nm 
—0.02 + 025 HIN 
sasada ansdina s SSS ian min TeV ------------ E 
Sy. 
-0.04 £ 00 : 
10? 10* 106 108 10° 10? 104 106 108 10° 10 10 106 108 10 10? 10 106 108 107 
RGE scale y in GeV RGE scale y in GeV 


Fig. 10.13. Left: Evolution of running self-coupling of the Englert-Brout—Higgs field at three 
loop level in the Standard Model. Right: Evolution of the most relevant couplings in the Standard 
Model [190]. 


way. Let us choose u = ¢ in (10.72). Then the effective potential becomes 
V(d) = AS) 9". 


Written in this form, the effective potential is determined by the running self- 
coupling’® \(¢). In particular, the potential becomes negative if the self-coupling 
changes sign. 

The selfcoupling A(jz) together with other couplings obey the renormalization 
group equations. Higher order corrections in this formalism show up as higher order 
perturbative contributions to the renormalization group functions (beta-functions). 
The behavior of the running self-coupling A(j) at three loop level is shown in 
Fig. 10.13 (left). For central values of the parameters of the Standard Model, the 
scalar self-coupling becomes negative at ¢ > 10'! GeV. This suggests that “our” 
vacuum is metastable. The situation is not completely clear yet, however: Fig. 10.13 
shows that uncertainties in the determination of couplings at the electroweak scale 
allow for the absolute stability of “our” vacuum at 30 C.L. If at some ¢, << Mp; the 
potential is indeed negative, then the cosmological scenario should be such that the 
field never reaches ¢,,, otherwise the field would stay in the “non-standard” vacuum. 
In particular, in inflationary scenario every scalar field takes values of the order of 
the inflationary Hubble parameter Hint. Therefore, the inflationary dynamics must 
be such that Hing < ox. 

Note that neglecting gravity, the self-coupling flattens out at large ọ ~ 
10°—10?? GeV, while other couplings do not; see Fig. 10.13 (right). In fact, the 
low energy values of the Standard Model couplings, in particular, t-quark Yukawa 


16There is a subtlety here which has to do with the anomalous dimension of the field ¢. This is 
not important for our purposes. 
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Fig. 10.14. Left: Regions of strong coupling, stability, instability and metastability of the Standard 
Model vacuum. Black dots with numbers show the field values at which the scalar self-coupling, and 
hence the effective potential, becomes negative. Right: Rectangle shown in the left part. Ellipses 
surround the region favored by the current data on the Standard Model parameters at 10, 20, 30 
C.L. [190]. 


coupling and Higgs self-coupling, are rather special. Namely, the electroweak vac- 
uum is likely false but long lived, while couplings remain small up to the Planck 
energy; see Fig. 10.14. 

Since it is likely that the elctroweak vacuum ¢ = v/v2 is metastable, it is 
of interest to estimate its lifetime. The false vacuum decay occurs through the 
spontaneous nucleation of bubbles, which at zero temperature is a tunneling process. 
The nucleation probability per unit time per unti volume has the following form: 


dP e- Slee] 
= dtdV ` RE ’ 
where S[R.] is the Euclidean action on classical Euclidean configuration in four 


dimensions, and Re is the size of this configuration; see details in [175]. The relevant 
configuration is an extremum of the Euclidean action 


Sp= | ate [0+ vO], 


where V(¢) is the effective potential, summation over js is done with Euclidean 
metric. This extremum obeys the Euclidean field equation 

OV 
Oo 


(10.73) 


—28,,0.6 + 0 (10.74) 


with the boundary condition 


o(|z| > œ) = ¢_, 


where ġ— is the field value in the false vacuum. 
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In the Standard Model we deal with very large fields, so we neglect the mass term 
of the field ¢. Also, we neglect weak (logarithmic) dependence of the self-coupling 
on @, and set 


V(¢) =—lA\¢" (10.75) 
with A = \(u) = const. Then Eq. (10.74) takes simple form 
ô ð p + 2|A|b? = 0, (10.76) 


and the boundary condition is (|x| — oo) = 0. It follows from Eq. (10.76) that the 
solution depends on À as ¢ œ |A|~!/?, so the action on the solution is S = const/|A\. 
To find the constant here, we solve Eq. (10.76). The solution is [188] 


2 Re 


$= Ty a+ Be’ 


(10.77) 


where z? = aa", Re is an arbitrary parameter, and the self-coupling A is to be 
computed at the scale u ~ R7 +. The Euclidean action on this solution is 
81? 
SIR] = —. 10.78 
(Rel = S (10.78) 


Had we neglected the dependence of A on energy scale, the theory with the poten- 
tial (10.75) would be scale-invariant, and this is precisely the reason for the appear- 
ance of the arbitrary parameter Re. To find the value of this parameter, we search 
for the maximum of the tunneling probability (10.73), and since the dependence of 
the tunneling exponent on self-coupling is strong, this means that we have to find 
the maximum of |A|, which is determined by the equation d\/du(u = Rz') = 0. 

Figure 10.13 shows that |A| is small for negative À, so the tunneling probability 
is strongly suppressed. To estimate the probability of the bubble nucleation in our 
part of the Universe, we recall that the size and lifetime of this part are of order 
H} ', so the relevant 4-dimensional volume is of order Q4(to) ~ Ho *. The bubble 
size Re is about thirty times smaller than the Planck length; see Fig. 10.13. In this 
way we get 


3-10!” GeV 
Ho 


and for |A| = 2-107? (see Fig. 10.13) we have P ~ 107340, The electroweak vacuum 
did not have time to decay. 


P(to) =T : Qa (to) ~ ( ) om, (10.79) 


Problem 10.9. Refine the estimate of the four-volume Q4(to) in (10.79). 


Let us look into the future. For t >> to the estimate for the tunneling time ttunn 
depends on the properties of dark energy. This time is greater if dark energy is 
the cosmological constant as compared to the Universe with decaying dark energy, 
which eventually becomes dominated by dark matter; see Fig. 10.15. Indeed, the 
relevant spatial scale in the Universe dominated by the cosmological constant is the 
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Fig. 10.15. Lifetime of the false electroweak vacuum, as a function of t-quark mass, in the Universe 
with the cosmological constant (right curve) and the Universe which is matter dominated in far 
future [190]. Vertical strip shows the experimental uncertainty in m+. 


event horizon size i = Ho 19, 2 Ay 1. Therefore, the tunneling time for this 
Universe is estimated from 


tann l e Trel (10.80) 
Re (HoR)? l i 


If the Universe is matter dominated in far future, then the relevant spatial scale is 
the particle horizon size H~+(t) ~ t, and the tunneling time obeys 


pM) \4 a2 
umn |} oe SIT ~ 1. (10.81) 


C 


By comparing (10.80) with (10.81) we see that t®), > ¢ indeed. 


Chapter 11 


Generation of Baryon Asymmetry 


As we discussed in Secs. 1.3 and 5.2, there are baryons and no antibaryons in the 
present Universe.! The present baryon abundance is characterized by the baryon- 
to-photon ratio 75 = Nz,0/N,0 ~ 6.05 - 10719. Another way to quantify the baryon 
asymmetry is to use the baryon-to-entropy ratio (see Sec. 5.2) 


Ap = a ~ 0.86 - 10729, (11.1) 
This ratio stays constant at the hot stage of the cosmological evolution provided 
there are no processes that violate baryon number or produce large entropy. One of 
the problems for cosmology is to explain the baryon asymmetry [34, 35]. According 
to our discussion in Secs. 1.5.5 and 5.2, for the initial state it is natural to assume 
baryon-symmetry. 

Let us stress right away that there is no unambiguous answer to the question of 
the origin of the baryon asymmetry. It could be produced either at hot stage or at 
earlier post-inflationary reheating epoch. We discuss reheating in the accompany- 
ing book, and here we consider several mechanisms that could work at hot stage. 
Before discussing concrete mechanisms, let us study general conditions necessary 
for baryogenesis. 


Besides the baryon asymmetry, there can be, and most likely there is, lepton asymmetry. If 
it is not large, its value does not seem to be possible to measure. Lepton asymmetry resides 
today in the excess of neutrinos over antineutrinos, or vice versa (electron number density 
equals that of protons by electric neutrality), while measuring relic neutrino abundance 
will be impossible in the foreseeable future. 


1Modulo exotica like possible existence of compact objects made of antimatter. The latter are 
predicted in some models [192] and do not contradict observational data [193]. 
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11.1. Necessary Conditions for Baryogenesis 


Baryon asymmetry generation at a certain cosmological epoch is possible only if 
three conditions are met. These are dubbed Sakharov conditions. Namely, three 
properties must hold simultaneously: 


(1) Baryon number non-conservation (with qualification, see below) 
(2) C- and CP-violation. 
(3) Thermal inequilibrium. 


The fact that the first condition is necessary to produce asymmetry is self-evident. 
Now, if C- or CP-invariance is exact, processes with quarks and leptons occur in 
the same way as processes with their antiparticles, and no asymmetry is generated. 
Hence the second condition. 


The latter result follows, at formal level, from the evolution law of the density matrix, 
p(t) = oO) p(t jet, (11.2) 


where H is the Hamiltonian of the system, t; is initial moment of time. C- or C’P-invariance 
means that the corresponding unitary operator Uc or Ucp commutes with the Hamilto- 
nian. Say, for C P-invariance 


UcpHUgp = Å. 
This gives 
Ucrp(tUcp = p(t), 


provided that the initial state is symmetric, i.e., Ucrp(ti)Ugp = p(ti). Together with the 
fact that the baryon number operator is C P-odd, Ucp BUcp = —B, the latter property 
gives 


(B) = Tr[Bo(t)] = 0. 


The medium remains baryon symmetric. 


Finally, the third condition is also fairly obvious. In thermal equilibrium with 
respect to baryon number violating reactions, the system is in a state with zero 
baryon chemical potential, i.e., zero baryon number density. Baryon asymmetry 
tends to get washed out, rather than generated, as the system approaches thermal 
equilibrium. (There are, however, special cases in which this rule is violated; we 
discuss an example later in this Section.) 

The latter conclusion needs qualification. It would be literally valid if the baryon 
number were the only relevant quantum number. We will see that lepton numbers 
are also important in the Standard Model of particle physics: the baryon number 
itself is not conserved at temperatures above T ~ 100 GeV, while its linear com- 
binations with lepton numbers are. In particular, conserved quantum number is 
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(B-— L) where L = Le + L, +L, is the total lepton number. In thermal equilibrium 
at nonzero (B — L) both baryon and lepton numbers do not vanish, 


B=0. (B-L), L=(C-1)-(B-L), (11.3) 


where the constant C is of order 1 (but somewhat less than 1). Thus, the baryon 
number does not vanish. This property is used in leptogenesis mechanisms: they 
produce lepton asymmetry (and hence (B — L)) at very high temperature by pro- 
cesses beyond the Standard Model, and then this asymmetry is partially reprocessed 
into baryon asymmetry by electroweak physics. 

Each of the three Sakharov conditions, generally speaking, is associated with 
one or another small parameter. Since all three should work at the same time, the 
net baryon asymmetry is naturally small in most models. We will see how these 
conditions are fulfilled in concrete mechanisms. 


The third Sakharov condition should not necessarily hold at the time the asymmetry is 
produced, although it must be satisfied at some cosmological epoch amyway. Namely, 
dynamics may be such that the equilibrium value of a global charge (B, L, (B — L), etc.) 
is nonzero for some time. This charge is generated if processes that violate this charge, as 
well as processes violating C and CP are in thermal equilibrium. 

To illustrate this idea, let us suppose that there is interaction of the global (say, bary- 
onic) current J with non-stationary background field ©, 


Lint = jn O-O. (11.4) 


This interaction breaks CP, and in the case of homogeneous but time-dependent ©® it 
breaks also invariance under time reversal (T-invariance), so CPT is also broken. This 
induces disbalance between particles and antiparticles: the interaction term (11.4) make 
the following contribution to the Hamiltonian density 


Hint = —Ënpg. (11.5) 


In thermal equilibrium with respect to baryon number and C’P-violating processes there 
is asymmetry 

VET NB: x; è (11.6) 

ngs +ng T i 
Indeed, having the term (11.5) is equivalent to adding baryon chemical potential ug = È. 

If ® itself depends on time, the equilibrium asymmetry also varies in time. It freezes 
out at temperature Tg, when baryon number violating processes switch off. Equations 
(5.22) and (11.6) give for the resulting asymmetry 
Aps nB-NB (Te) 
sS Tg 

This mechanism of the baryon asymmetry generation is called spontaneous (or gravita- 
tional) baryogenesis [215, 216]. 


To end this Section, let us see that irrespective of the baryogenesis, the very 
fact that baryon asymmetry exists implies that there is no relic antimatter in the 
Universe. 
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To estimate the relic abundance of antibaryons, let us write the Boltzmann 
equation for antibaryon number density ng: 


d(ng a?) 
dt 


= — (Cann) * (nenga? — niina’), (11.7) 


where Gann is the annihilation cross-section. (We neglect the difference between the 
annihilation cross sections of protons and neutrons.) In the non-relativistic situation 
we have Gann = go/v (see Sec. 9.3), where 


oo ~ 1 (Fermi)? = 10776 cm? ~ 25 GeV”. (11.8) 
Equation (11.7) is obtained in the same way as Eq. (9.7) for dark matter particles. 


Problem 11.1. Derive Eq. (11.7) by making use of eq. (5.59). Hint: (oannv):Ne = 


Tan, is the annihilation probability per one antibaryon per unit time (i.e., Tann is 


the lifetime of an antibaryon in the cosmic medium). Make use of Eq. (5.51) and 
the fact that the baryon density greatly exceeds the antibaryon density. 


When the annihilation rate exdceeds the cosmological expansion rate, the 
antibaryon abundance is nearly equilibrium. This situation occurs at high enough 
temperature, as we will soon see. Chemical equilibrium breaks down when the anni- 
hilation and creation rates become insufficient to keep chemical equilibrium. The 
change of regimes occurs when (cf. Eq. (9.14)) 


oe (Cannv) : nina’. (11.9) 


d(n%a?) 
dt 


At about this time the antibaryon abundance gets frozen out. 

It remains to find the freeze-out temperature and the equilibrium antibaryon 
density at this temperature. Since chemical potentials of particles and antiparticles 
differ only by sign in thermal equilibrium, we write at T < mp (we neglect the 
difference between proton and neutron masses and do not write spin factors), 


3/2 3/2 
a Tiot _™p7HB é Mpl _ mptup 
nid = ( - ) e T 7 n% = ( i ) e T , (11.10) 


Note that at T < m, the baryon density n% is not exponentially small only for 
Us = Mp + O(T). Using ng = ns - ny, we find from (11.10) 


1 T\? _2mp m3 mp 
n= (Z ) ere n e et : (11.11) 


The right-hand side rapidly changes with temperature, so 


eq 
3 dng 


dt 


d(nga°) 
dt 
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We now use |T/T| = H(T) = T?/M#, and obtain from (11.11) that 


dn? m 
3 B 3''"p 
re wo” A -= nS, 


The freeze out relation (11.9) takes the form 


~ (Cannv) Np ~ Tons T?. 


In this way we obtain freeze out temperature, 


= 1/2 
T~ eS} ~ 10 keV. 11.12 

(aE “Ip ` 90 l ) 
The antibaryon abundance is fantastically small at this temperature: we find 
from (11.11) 


na ~ 1071, (11.13) 
no matter in which units. The visible Universe contains no single relic antibaryon. 


Clearly, one naturally questions the validity of statistical physics methods, which we used 
in our calulation, for the system with so small number of antibaryons. Still, there is no 
doubt that all antibaryons annihilate away in baryon-asymmetric Universe. 


Problem 11.2. Solve the Boltzmann equation (11.7) in quadratures. Compute the 
integral by saddle point method and find the temperature that gives the largest 
contribution into the present value of ng, thus refining the estimate (11.12). Find 
the present value of ng. Is it consistent with (11.13)? 


Problem 11.3. Find the relic positron abundance at the present epoch. 


11.2. Baryon and Lepton Number Violation in Particle 
Interactions 


In this Section we discuss two mechanisms of baryon number violation. One indeed 
works in Nature (provided that the temperature in the Universe much exceeded 
100 GeV), since it exists already in the Standard Model [194]. Another is inherent 
in Grand Unified Theories, GUTs (see, e.g., Ref. [195]). Even though there is no 
direct experimental evidence for Grand Unification, this hypothesis is very plausible. 
Finally, we will see in what follows that lepton number violation is also relevant for 
baryogenesis; we discuss one of the mechanisms of this violation at the end of this 
Section. 
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11.2.1. Electroweak mechanism 


Baryon and lepton number violation are non-perturbative in the Standard Model; 
they are not visible in Feynman diagrams. We will only give here sketch of what is 
going on; interested reader may find details in Ref. [175] or in appropriate reviews. 

The phenomenon we are going to discuss is called ’t Hooft effect [138]; it is 
due to gauge interactions corresponding to the subgroup SU (2)w of the Standard 
Model group SU(3). x SU(2)w x Uy (see Appendix B). These interactions involve 
left quarks and leptons. The classical Lagrangian is invariant under common phase 
rotations of all quark fields; this symmetry would give rise to baryon number con- 
servation. It is also invariant under phase rotations of leptons of each generation 
separately (we neglect neutrino masses and mixing at this point); these would corre- 
spond to the conservation of three lepton numbers Ln, n = e, u, T, see Appendix B. 
However, the corresponding currents j/7, jj," are anomalous at the quantum level, 


. g’ va a 
o"j? = a ve, (11.14) 
arji = Te vapa, #2125, (11.15) 


where Vi, = ôu Vs — OVP + ge VPV is the field strength of the SU (2)w gauge 
field, ve = = $€yvrpV?" is the dual tensor, g is SU(2)w gauge coupling. The reason 
for his anowaly is that left and right fermions interact with the field V; in different 
ways? (in fact, right fermions do not interact with V;? at all). 

Equations (11.14) and (11.15) show that baryon and lepton numbers are not 
conserved,® if there emerge non-zero gauge fields in vacuo or in medium, 


AB = B(t;) — a= at f axa = a dx rh veye, fiti) 
where t; and ty denote initial and final time. Similar relation is valid for se of the 
lepton numbers. Baryon number violation requires strong fields, V% x z making 
the integral (11.16) different from zero (this integral takes sheer values only). 
Energy of very strong fields is proportional to +. Thus, we conclude that baryon 
and lepton number violation occur when the system overcomes energy barrier; see 


Fig. 11.1. The estimate for its height is 


Mw 


Eph ~ —— 
P 2°? 
g 


2In QCD, left and right quarks interact with gluon field in the same way, and baryon number is 
conserved in strong interactions. 

3We do not discuss the actual physics leading to non-conservation; see in this regard [175] and 
references therein. 
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n=-l n=0 n=1 n=2 {V9} 

Fig. 11.1. Schematic plot of static energy as functional of classical gauge and Higgs fields. The 
horizontal line corresponds to (infinite-dimensional) space of all field configurations {V, ¢}. Abso- 
lute minima labeled by an integer n = 0,+1,+2,... are pure gauge configurations of zero energy, 
in which the fields V(x), ¢(x) have different topological properties (topologically distinct vacua). 
The integral in (11.16) is equal to 1 for the fields that evolve from the vacuum with topological 
number n to the vacuum (n+ 1) as time runs from t; to ty. This integral is zero for fields evolving 
near one vacuum. The “maximum” with energy Espn is in fact a saddle point: energy decreases 
along one direction in the configuration space and increases along all other directions. 


where the factor My is inserted on dimensional grounds.* The calculation of the 
barrier height (sphaleron energy) gives [196] 


2My (mar 
Esn = €B ( +), 
Ph aw (7) 


where 


the function B(mp/Mw) takes values from B = 1.56 at mp/My <1 to B = 2.72 
at mp/My > 1, and B = 2.4 for mp = 125 GeV. Thus, the barrier height in the 
Standard Model is about 11 TeV. 

At zero temperature and zero fermion density the transmission through the 
energy barrier is only possible via quantum tunneling. This tunneling is described 
by instanton [197]. The tunneling probability is exponentially small, 

Ta ear, 
Since ay ~ 1/30, the suppression factor is extremely small, r œ 107165, Baryon 
number violating processes practically do not occur in usual circumstances. 


4The reason for notation Espn is as follows. The height of energy barrier is equal to energy of saddle 
point configuration, which extremizes the static energy. This configuration is called sphaleron, from 
Greek og@aAepov, unreliable, ready to fall. 
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Another situation occurs at finite temperatures [194]. In that case, the barrier 
may be overcome via thermal jumps on its top (better to say, on the saddle point). 
At relatively low temperatures a naive estimate for the suppression factor in the 
rate is given by the Boltzmann formula for configuration of energy Espn, 

Esph 
D spn xe Tr 
This estimate, however, is incorrect at interesting temperatures when the suppres- 
sion factor is not extremely small. The probability of the realization of a given 
configuration is determined by its free energy, rather than energy. In our case the 
main effect is that the Englert-Brout—Higgs expectation value, and hence Mw, 
depends on temperature; see Chapter 10. So, the better estimate is 


4 _ Esph(T) 
Depr = C- Tte T, (11.17) 
where 
2Mw(T) (mn 
Fop T) =R —— ], 11.18 
p(t) = D p (e) (11.18) 


and one can use zero-temperature masses in the argument of the function B. (In fact, 
the latter statement is not quite correct, but we will not need further refinement, in 
view of weak dependence of B on its argument.) IP sph is the over-barrier transition 
rate per unit time per unit volume, so we have written the factor T4 in (11.17) on 
dimensional grounds. The pre-exponential factor C is dimensionless; it depends on 
temperature, the Englert-Brout—Higgs expectation value and couplings. The most 
important in (11.17) is the exponential factor, so we will set 


C~wl 


in our estimates. 

Thus, at low enough temperatures the rate of electroweak baryon number viola- 
tion is given by (11.17). This formula does not work, however, at high temperatures, 
when Fsph < T and the exponential suppression is absent. The latter situation 
occurs in unbroken phase,” when (¢) = 0 and hence Mw(T) = 0. In that situation 
one makes use of the estimate (modulo logarithm) 


i “=o, (11.19) 


where x’ is a numerical coefficient. This estimate follows, to a certain extent, 
from the results of Sec. 10.3. We have seen there that at high temperatures and 
Mw(T) =0 the theory possesses the non-perturbative parameter g? = g?T. The 
sphaleron transition rate is mostly determined by this parameter; using it, we would 
obtain dimensional estimate I sph ~ g2 ~ (awT)*. The additional factor œw is due 


5Our terminology here is standard, though not quite appropriate, see discussion in Sec. 10. 
gy g pprop 
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to specific plasma effects [198]. The coefficient x’ has been found by lattice simula- 
tions and it turned out to be rather large [199], 


Note that if one substitutes the non-perturbative scale g? into (11.18) instead of 
Mw(T), the sphaleron free energy becomes of the order of temperature, so the 
Boltzmann suppression is indeed absent. 

Let us make use of the estimates (11.17) and (11.19) to find the range of tem- 
peratures in which electroweak baryon number violating processes are in thermal 
equilibrium in the early Universe. This is the case when a particle participates in 
at least one of these processes per Hubble time, i.e., Pspn Z nH, where n ~ T? is 
the number density of particles of a given type. Thus, the condition for equilibrium 
with respect to the sphaleron processes is 


ees 7 
bh > H(T) = (11.20) 
T3 M3, 
At high temperatures, we use (11.19) and find 
T < Toph ~ 10"? GeV. (11.21) 


At relatively low temperatures the condition (11.20) gives 


Making use of the crude estimate T ~ 100 GeV in the argument of logarithm, we 
find numerically 


My(T) 0.66 
— < —— © 0.28. 11.22 
T ~“ B(ma/Mw) ( ) 
Thus, the sphaleron transitions switch off after electroweak transition only, i.e., 
at T < 100 GeV. We conclude that electroweak baryon number violation is in 


thermal equilibrium in wide temperature interval extending from 101? GeV tof 
about 100 GeV. 


6Numerical calculation shows that the expansion rate starts to dominate the sphaleron rate in the 
Standard Model at T ~ 132 GeV [199]. 
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Let us now discuss selection rules for electroweak baryon number violating pro- 
cesses. They follow from the relations (11.14) and (11.15) and have the form 


AB = 3AL, = 3AL, = 3AL,. 


In other words, there are three conserved combinations of baryon and lepton num- 
bers, which can be chosen as follows, 


(B-L), (Le-Ly), (Le - Lr). (11.23) 


At temperatures 100 GeV < T < 10!” GeV these numbers may be non-vanishing, 
while baryon and lepton numbers themselves are adjusted in such a way that the 
Grand canonical potential is at its minimum. 


Let us find baryon and lepton number densities at temperatures above the electroweak 
transition but below (11.21) at given (B—L) number density, assuming that the three lep- 
ton number densities are equal to each other. Let us make the calculation for vf fermionic 
generations and vs Englert—Brout—Higgs doublets. Quantum numbers of all particles are 
given in Appendix B. Above the electroweak transition, it is convenient to work in terms 
of the scalar doublets with components h* and h°, their antiparticles and gauge bosons of 
two polarizations. To calculate particle number densities, we introduce chemical potentials 
to all conserved quantum numbers. Relevant quantum numbers are (B — L) and weak 
hypercharge Y (see Problem 5.4). Then particles of type J have chemical potential 


Y 
ur = (Br — Lr) + TE 
while for antiparticles u7 = — yur. Here Br, Lr and Yr are baryon number, lepton number 


and weak hypercharge of particle J, u and uy are chemical potentials to (B — L) and Y/2. 
As an example, for left electron and neutrino we write 


il 
Hv = He, = =H- 3HY, 


while for charged and neutral scalars (components of one of the Englert-Brout—Higgs 
doublets) we have 


1 
Hh+ = Hpo = aly: 
etc. We make use of the result of Problem 5.4 and write for asymmetry of fermions of type 
F of all generations 


1 T 
Np —np = Anp = art : Me» 


while the asymmetry of scalars of type H is 


2 
Ny — ny = Any = Vs ` Hu ` —. 


3 
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(We assume here that asymmetries are small, so that ur << T.) This gives 


Anp p+ + Ango = Vs + by + = 

1 T? 

An, + Ani, = vy (5n — n) -— 
1 

Amp = Zvsl-uv =p): 


1 1 2 

Anu, + Ana, = vf (Fu + Zu) -3 — 
1 2 1 T2 

Anup = Spr | =y += Ban, 

Nur gs (Fu +u) 3 


1 1 1 T 
Anap = gt (5u F Zu) “3: 3° 
(11.24) 


where the factor 3 in the last three formulas accounts for quark colors. Gauge bosons carry 
zero baryon number, lepton number and weak hypercharge, so there is no asymmetry in 
them. 

The system is neutral with respect to all gauge charges, including weak hypercharge 
(this is the analog of electric neutrality of the usual plasma), therefore 


5 Yr $ Anr = 0. 
I 
Making use of (11.24) we find 
5 4 1 
> = =Vs° =0. 11.25 
Vy [Bu + Su] +i HY ( ) 


This explains why we introduced the chemical potential uy: had we set uy = 0 in the 
beginning, the medium would not be neutral with respect to weak hypercharge at u Æ 0. 

We now eliminate one of the chemical potentials, say, 4, by making use of Eq. (11.25) 
and express all asymmetries (11.24) in terms of the only remaining chemical potential. In 
this way we obtain for baryon number density, which we denote simply by B, 


1 T? fi 1 
B= 3 (Anus + Ana, + Anup + Anag) = -= gut t gus Ly, (11.26) 


while the lepton asymmetry is 
oe a Loy 
=a e ig e 
Thus, the value of (B — L) is related to uy by 


Let us again use (11.26) and obtain finally 


8Vf + 4vs 
E 22vf + 13vs 


. (B-L). (11.27) 
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This gives the constant C entering (11.3). There are three generations of fermions which are 
effectively massless at electroweak temperature, so in the theory with one scalar doublet 
we find 


87 + 4vs 28 


ape ye 11.28 
Davy, + ay, T 3 =) =e te) 


We stress that this value, as well as the whole analysis, is valid above the electroweak 
transition only; at the tarnsition epoch and later, the parameter C depends on temperatue 
and is somewhat (but not dramatically) different from (11.28); see, e.g., [200]. 


Problem 11.4. Considering temperatures above electroweak transition, introduce, besides 
u and uy, the chemical potential u3 to the diagonal (third) component T? of weak isospin. 
Find asymmetries in all particles including vector bosons and show that the requirement of 
neutrality with respect to the weak isospin is equivalent to u3 =0 at any u and py. Note: 
This result is not valid below electroweak transition temperature. 


Problem 11.5. For temperatures above electroweak transition, consider general situation 
in which the densities of all conserved quantum numbers (11.23) are nonzero. Show that 
the baryon asymmetry is still given by (11.27). 


11.2.2. Baryon number violation in Grand Unified Theories 


Another mechanism of baryon and lepton number violation exists in Grand Uni- 
fied Theories (GUTs). These contain new superheavy particles, vectors and scalars 
(and also fermions in supersymmetric theories) whose interactions with the Stan- 
dard Model particles violate baryon and lepton numbers already at the level of 
perturbation theory. As an example, there is a vertex shown in Fig. 11.2(a). This 
vertex describes the interaction of vector boson V with two quarks (unlike the 
gauge vertices of the Standard Model which contain both quark and antiquark). 
The existence of this vertex does not yet mean that baryon number is violated: 
were this vertex the only one, we could assign baryon number 2/3 to the boson V, 
and baryon number would still be conserved. However, there is also a vertex shown 


(a) (b) 


Fig. 11.2. Vector boson interaction with quarks q, antiquarks g and antileptons l. 
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Fig. 11.3. V-boson exchange leading to baryon number violation. 


=>> p —> etr? 


Fig. 11.4. Proton decay diagram. 


in Fig. 11.2(b) (we will explain shortly why it involves antilepton and not lepton). 
Therefore, baryon number is indeed violated: V-boson exchange leads to the process 
shown in Fig. 11.3. 

Interactions of Fig. 11.3 give rise to proton decay, see Fig. 11.4. There are very 
strong experimental bounds on the proton lifetime: depending on the decay mode 


Tp > 10%? — 10°? yrs. (11.29) 


This leads to strong bounds on baryon number violating interactions. The dia- 
gram 11.4 implies the following estimate for the proton width, 


1 a? 
r,=—~ Em 
P g p 
Tp Mọ 
2 
where ay = 4, gy is the coupling constant in each vertex; the factor M7? in 


amplitude, and hence M7‘ in the width, comes from the V-boson propagator, and 
the factor mý is introduced on dimensional grounds. Together with (11.29) this gives 
the bound 


My 2 10° GeV. (11.30) 


Thus, we are dealing with interactions that could be of interest for cosmology only 
if the temperatures in the Universe were as high as the GUT energy scale 


Meur ~ 101° GeV. (11,81) 


If the theory does not involve new fermions beyond the Standard Model ones, 
then the baryon number violating interactions are indeed given only by diagrams of 
the types shown in Fig. 11.2, with the diagram 11.2(b) containing precisely antilep- 
ton. This is true both for new vector boson V and possible new scalar boson S. 
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Q| 


AL 


Fig. 11.5. (B — L)-violating vertex. 


This follows from the invariance under the Standard Model gauge group; we will 
see that in the end of this Section. This result is important, since all these inter- 
actions conserve (B — L) if we assign (B — L) = 2 to vectors V and scalars S. 
Hence, baryon asymmetry cannot be due to these interactions only: they would not 
generate non-vanishing (B — L), and electroweak processes studied in the previ- 
ous Section would wash the baryon asymmetry out. We note that similar situation 
occurs in supersymmetric models as well. 

If there are new fermions in the theory, (B — L) may be violated simultaneously 
with B. A simple example is given by a theory with extra fermion A, which is 
neutral with respect to the Standard Model gauge interactions and whose baryon 
and lepton numbers are zero. Then the theory may contain both vertices shown in 
Fig. 11.2 and vertex of Fig. 11.5. Final states in Fig. 11.2 have (B — L) = 2, while 
the final state in Fig. 11.5 has (B — L) = —4, so (B — L) is not conserved indeed. 
Another possibility is that A, is lepton, i.e., its lepton number is L = 1. 


The reason for the existence of interactions shown in Fig. 11.2 in GUTs is as follows. Grand 
Unification of strong and electroweak interactions assumes that at ultra-high energies, 
all these gauge interactions are one and the same force. In other words, the Standard 
Model gauge group SU(3). x SU(2)w x Uy is a subgroup of simple gauge group G, and 
known fermions combine (possibly together with new fermions) into representations of 
G. This immediately implies that a multiplet containing the lepton doublets (as well as 
possibly different multiplet containing right charged lepton) must also contain fermions 
that transform non-trivially under SU(3)c, i.e., carry color. If one does not introduce many 
new fermions, the color partners of leptons are identified with quarks (or antiquarks). 
Now, since quarks and leptons are in one gauge multiplet, there must be a gauge boson 
interacting as shown in Fig. 11.2(b). A multiplet with colored SU(2)w-doublets may also 
contain colored SU(2)w-singlets. In other words, a multiplet under group G with left 
quark doublets may contain also SU(2)w-singlet colored fermions, which should also be 
left. These are left anti-quarks. So, there may exist multiplets of the full gauge group G 
that contain both quarks and antiquarks. This leads to interaction shown in Fig. 11.2(a). 

The simplest, but probably unrealistic example is given by the theory with gauge 
group G = SU(5) [201], whose algebra includes the Standard Model gauge algebras in the 
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following way, 


SU(3)e : ee Daxa ) 


02x3 02x2 
(11.32) 
03x3 03x2 
SU (2 : ; 
(2)w cs a 
U(i)y:Y = Špa, =- - diag(2, 2, 2, —3, —3). (11.33) 


3 2/15 


(Subscripts in (11.32) denote dimensions of matrices; Omxn is matrix with zero entries.) 
New fermions are not needed to complete SU(5) multiplets, while all Standard Model 
fermions of one generation are comfortably placed in representations 5 (antifundamental 
representation of SU(5)) and 10 (antisymmetric representation of SU(5)): 


Ez C c C — 
5= ( L1, di2, A13, €r, Ve), 


0 uss us? out dt 
—us3 0 ust uz dz 
10=| u$? -utt 0 ue Ë|, 
—ut =u? uF 0 el 
di, di, dg -ef 0 


where superscript refers to color and uf and d{, denote left antiquark fields (antitriplets 
under SU(3)- and singlets under SU (2)w ). 


Problem 11.6. Show that all fermions have correct quantum numbers with respect to 
the Standard Model gauge group SU (3)e x SU(2)w x Uy embedded in SU(5) according to 
(11.32), (11.33). 


Gauge bosons, as always, belong to adjoint representation of SU(5) (24-plet). Besides 
the Standard Model gauge bosons corresponding to the algebras (11.32), (11.33), there are 
12 extra gauge bosons whose fields are embedded in SU(5) algebra as follows, 


o 0 oO Vi UL 
c 0 0 vV U} 
0 oo 0 VW U 
Vo V Vyve O g 
uy U Ur © 0 


Here every field Uji, Vg is complex and describes two vector bosons. Their interactions 
with quarks and leptons have precisely the form shown in Fig. 11.2. 


Problem 11.7. Write all terms in the Lagrangian describing the interactions of V,- 
and U„-bosons with quarks and leptons. Hint: To simplify notations, notice that (Va, Up) 
together form SU (2)w -doublet and that they are SU(3)--triplets. 
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Fig. 11.6. Sketch of the evolution of the Standard Model gauge couplings with momentum transfer 
2 f 


2 
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in the supersymmetric extension of the Standard Model. Here a3 Qs 
2 
where gs and g are gauge couplings of SU(3)- and SU(2)w; a1 = 5, where g’ is gauge coupling 
of U(1)y; the factor 3 is due to the fact that the normalization of the generator U(1)y is different 
from the standard normalization of the generators of GUT gauge group. (As an example, trace of 


the generator T?4 squared in SU(5) equals the standard value 1/2, while trace of Y? equals 5/6, 
see (11.33); hence, gauge couplings of SU(5) and U(1)y are related by gsu(5) = (39') 


An? 


The energy scale (11.31) emerges in GUTs in a natural way [202]. As we pointed out 
in Sec. 9.4.2, couplings logarithmically depend on energy (more precisely, on momentum 
transfer Q). The gauge couplings of the groups SU(3)-, SU(2)w and U(1)y are very 
different at low energies, but as the energy increases, they get closer, as schematically 
shown in Fig. 11.6. In supersymmetric extension of the Standard Model (but not in the 
Standard Model itself), all three couplings become equal to acur % 1/25 at Q = Maur ~ 
10'° GeV [131]. This is precisely what should happen in a theory with a single gauge 
interaction above energy Maur with single coupling. Gauge coupling unification is a very 
strong argument both for Grand Unification and for the supersymmetric extension of the 
Standard Model at relatively low energies. 

Let us continue our general discussion and find the constraints that the Standard 
Model gauge symmetries impose on the structure of baryon number violating interactions. 
Let us assume for the time being that there are no new fermions. It is convenient in this 
Section to treat all fermions as left, i.e., instead of right quark and lepton fields Ur, Dr 
and Er consider left fields denoted by Uz, DE and EY and describing left antiquarks and 
antileptons. These are singlets under SU(2)w, antitriplets (Uz, DZ) and singlets (Æ) 
under color SU (3)e; their weak hypercharges are opposite in sign to hypercharges of the 
respective particles, i.e, 


4 2 
Yue = T3 Ype = 3 Yre = 2. 
Let us also recall the weak hypercharges of left doublets, 
1 
Yo, = 3 Yr, =-l. 


In terms of left fields, the only Lorentz-invariant renormalizable interaction with vector 
field has the Lorentz structure 


priy” Vues x Vipers, (11.34) 
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while Yukawa interaction with scalar field S can only have the form (compare with 
Majorana mass term for neutrino, Appendix C) 


BiShr x Shipy or YrSwr x Spip, (11.35) 


where i, qj are various Standard Model fermion fields. We stress that (11.34) contains 
both % and %, while (11.35) contains only 7 or only %. 

Interactions (11.34), (11.35) must be invariant under all gauge symmetries of the 
Standard Model. Let us begin with color SU(3)¢. If there exists vertex of Fig. 11.2(a), then 
V is either color antitriplet or sextet (since SU(3)- representations obey 3 x 3 = 6 + 8). 
SU (3)e forbids other vertices in the sextet case, so this case is not interesting. For anti- 
triplet V, there are two more vertices allowed by SU (3)e: this is the vertex of Fig. 11.2(b) 
and similar vertex with lepton instead of antilepton. The same holds true for a scalar 
S. We see that interactions with (anti)triplet bosons exhaust all possible baryon number 
violating interactions, unless new fermions are introduced. 

Further constraints come from the requirement of invariance under SU(2)w and weak 
hypercharge. Let us begin with vector particles V. Since Uz and D{ are antiquark fields, 
there are two combinations of the fields that can enter the interactions of the type (11.34) 
and give rise to the vertex of Fig. 11.2(a): 


ULQ (2. 5), (11.36) 

a 1 

DQ (2. -3). (11.37) 
(We show in parenthesis the representations of SU(2)w — doublet 2 in our case — and 


weak hypercharge of each operator.) The combination of antiquark and lepton or antilepton 
interacting with V must have the same quantum numbers. The doublet under SU(2)w 
combinations are 


QLEY (2.2). LUE (2-3). LDS (2.2). (11.38) 


(Recall that doublet representation of SU(2) coincides with its conjugate.) We see that 
the vector boson V must be a doublet under SU(2)w. There are two possibilities: either 
it has weak hypercharge -4 and has interactions 


VIDO Qr +V'LUL® + h.c., 


or its weak hypercharge is 3 and the interactions are 


WO Op + VİQLES 4 VLD + he, 


where we do not write Lorentz structure and couplings. The corresponding vertices for the 
second case are shown in terms of usual quarks and leptons in Fig. 11.7. 
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Fig. 11.7. Vector boson interactions that violate baryon number. V4/3 and V1/3 denote upper 
and lower components of the weak doublet, 4/3 and 1/3 are electric charges. Quarks and leptons 
are assumed to belong to the first generation; in fact, each vertex may contain arbitrary linear 
combination of quarks and leptons of all generations with quantum numbers shown here. 


Table 11.1. Baryon number violating interactions in terms of the usual quark 
and lepton fields. 


Quantum numbers with respect to 


SU(3)c x SU(2)w x U(1)y Interacts with 
vector Vi 3, 2, -4 DrQr, LÜr 
vector Vz 3, 2, = UrQt, QiEr, LDr 
scalar Sı 3, 3, 2 QrQr, QLL 
scalar S2 3.1, Z QLQL, UrDr, QLL, ÜrËr 
scalar S3 3, 1, š UrUr, DrEr 


Let us now turn to scalars. Now we have to consider interactions with Lorentz structure 
(11.35). The combinations leading to the vertex of type of Fig. 11.2(a) are 


QrQ1 (s. z) (11.39) 


a. (1.3), o: (1.3), or (a3) pep: (a-f) ao 


There is one triplet combination of the type (11.35) involving antiquark and giving rise to 
the vertex of the type of Fig. 11.2(b), 


QLL (3 z), (11.41) 


and three singlet combinations, 


QLL (2.3). U B; (1.3), DL Et (2.3). (11.42) 


Thus, triplet under SU(2)w scalar of weak hypercharge 2 can interact with combina- 
tions (11.39), (11.41), while siglet scalars can interact with appropriate combinations 
from (11.40), (11.42). All these possibilities are listed in Table 11.1; structure of all inter- 
action vertices is the one shown in Fig. 11.8. We see that (B — L) is conserved in the 
absence of new fermions. 

The structure of vertices for all these interactions is shown in Fig. 11.8. Thus, we again 
see that symmetries of the Standard Model ensure (B — L) conservation unless there are 
new fermions. 
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(a) (b) 


Fig. 11.8. Interactions with scalars also conserve (B — L). 


Ar 


Fig. 11.9. Annihilation of two Az. 


Let us now include into the theory a new fermion Az. Let it be a singlet under the 
Standard Model gauge interactions. Color anti-triplet combinations that could interact 
with vectors and lead to vertices shown in Fig 11.5 are 


= 1 x tre 4 T ae 2 
QrAL (2-1) ’ ALU; (1-4) , ALD, (1.3), 


The first of them has quantum numbers (11.37), so that color antitriplet vector boson with 
these quantum numbers can have vertices shown both in Fig. 11.2(a) and in Fig. 11.5. The 
scalar combinations of antiquarks and new fermions are 


aoe 1 : 4 2 2 
QrAL (2-3). Ur Ar (2-4) DiAz (2.3). 


The second and third of them match one of the combinations in (11.40), so there are two 
types of scalars that have vertices shown both in Fig. 11.8(a) and in Fig. 11.5. 

The neutral fermion Az may have Majorana mass; see Appendix C. If it is stable, it 
is cosmologically allowed and for appropriate values of parameters it can be dark matter 
candidate, provided it pair annihilates via, say, exchange of a new neutral scalar X, see 
Fig. 11.9, and the annihilation cross-section is sufficiently large.’ On the other hand, if 
baryon asymmetry is generated in the processes shown in Figs. 11.2 and 11.5, then there 
should be practically no interactions with leptons and Englert—Brout—Higgs doublets like 


ha HALL + he. (11.43) 


These would give rise to the vertex of Fig. 11.10 that violates lepton number and (B — L), 
and hence in combination with electroweak processes would wash out baryon asymmetry. 


T The existence of these processes means that the lepton number of Az is naturally set equal to zero. 
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(a) (b) 
Fig. 11.10. Lepton number violation in interactions with the Higgs boson. 


Problem 11.8. Assuming that masses of AL- and X-particles are small compared to 
masses of V- and S-bosons participating in baryon number violating processes, and that at 
energies below these masses A, -particles participate only in interactions shown in Fig. 11.9, 
find cosmological bounds on masses ma, Ms and Yukawa couplings. Consider the cases 
ma > Ms and Mma K Ms separately. At what values of these parameters is the fermion 
A, a dark matter candidate? 


Problem 11.9. Let the fermion A, have Majorana mass ma >> mn, where mp, is the 
mass of the Standard Model Higgs boson. 


(1) At what values of the coupling ha in (11.43) is the interaction of Fig. 11.10 irrelevant 
for cosmology? Assume that the annihilation processes of Fig. 11.9 lead to negligible 
number of A-particles at low temperatures. 

(2) The interactions (11.43) contribute to masses of the conventional neutrinos via see- 
saw mechanism (see Appendix C). Using the previous result, find bounds on these 
contributions. 


Problem 11.10. Let the interactions of Fig. 11.9 be absent, while interactions of 
Fig. 11.10 be non-negligible. Let the baryon asymmetry be generated at T > ma (say, 
in decays of super-heavy scalar bosons S — qq and S — GAz; see Figs. 11.5 and 11.8). 
Assuming that fermions Ar have large Majorana mass ma >> mp, find bounds on ma 
and h, from the following requirements: (a) baryon asymmetry must be preserved until 
the present epoch; (b) fermions A; must have decayed long before BBN epoch. Estimate 
in this case the contribution of interactions (11.43) to Majorana masses of conventional 
neutrinos. 


Let us now turn to another case mentioned above, namely, that the neutral fermion Az 
has lepton number +1, while lepton number is conserved below Maur modulo electroweak 
effects, and new lepton has short enough lifetime. The minimal possibility is that A; is 
the left component of a Dirac fermion A. Then the Standard Model Lagrangian can be 
extended by adding terms (we do not write kinetic term for A), 


M, AA +h, ARL + h.c. 


If the mass of A is large compared to the mass of the Higgs boson, Ma > mp, then the 
main decay channels of the lepton A are A > h°v, A —> h*I*, while for Ma < mp its 
decay proceeds via the Higgs boson exchange. Heavy leptons of this type are cosmologically 
acceptable. 
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Problem 11.11. Are there massless neutrinos in the latter extension of the Standard 
Model? 


11.2.3. Violation of lepton numbers and Majorana 
masses of neutrino 


As we discuss in Appendix C, one of the plausible ways to explain small neutrino 
masses is the see-saw mechanism [203-206]. One adds new fields N@, heavy sterile 
neutrinos, which we treat as left fermions. These fields are neutral with respect 
to the gauge interactions of the Standard Model, hence the name “sterile”. The 
superscript œ labels the species of these new fields; in what follows we assume, 
although this is not completely necessary, that a = 1, 2,3, i.e., there are three new 
fields, according to the number of Standard Model generations. The fields N? have 
Majorana masses and interact with the Standard Model fields, so additional terms 
in the Lagrangian are 
M, 


L= — NaNe — yo N aH Lg + h.c., (11.44) 


where La are left leptonic doublets of the Standard Model, H is related to the 
Standard Model field H by (see Appendix B) H, = cij H7 (i,j are the indices of the 
doublet representation of SU(2) yw), summation over a, 3 is assumed. The Yukawa 
couplings Yag are in general complex while the masses Ma are real. The second 
term in (11.44) is the only interaction of N,q consistent with the Standard Model 
gauge symmetries, unless other new fields are introduced. 

Neglecting the second term in (11.44) (this is an excellent approximation in 
the present context) one observes that the fields Ne describe three fermions of 
masses Ma. We will assume that Ma > v; this range of masses is indeed preferred 
from the viewpoint of baryogenesis. The Yukawa interaction in (11.44) leads to 
the instability of N-particles. We will be interested in high temperature situation, 
when the Englert-Brout—Higgs expectation value is zero (see Chapter 10). Then 
the doublet H describes one electrically neutral and one electrically charged scalar 
particle plus their antiparticles. Assuming that Ma > mp, the Yukawa interaction 
explicitly written in (11.44) gives rise to the decay (Fig. 11.11(a)), 


Na — hlg, (11.45) 


where lg is a charged lepton or neutrino of generation 3, while h is one of the Higgs 
particles. The Hermitean conjugate interaction term gives rise to the decay into 
antilepton in the final state? (Fig. 11.11(b)) 


Na > hlg. (11.46) 
8The scalar particles in (11.45) and (11.46) are actually different: say, in the case of charged lepton 


h = ht and h = h™ in (11.45) and (11.46), respectively. This will be unimportant for what follows, 
and we will use somewhat vague notation h. 


334 Generation of Baryon Asymmetry 


(a) (b) 


Fig. 11.11. Decays of heavy neutrino. 


le 


Fig. 11.12. Lepton—Higgs scattering with lepton number violation. 


Clearly, the existence of both of these processes means that lepton numbers are 
not conserved, whatever lepton numbers are assigned to Na. The same conclusion 
follows from the existence of a process (Fig. 11.12) 


hla > hla, 


as well as neutrinoless double-8 decay. It is useful for what follows to estimate the 
decay width, 


y? 
Tn, ~ =Ma. (11.47) 
8T 


More precisely, the total decay width is given by (assuming Ma > v) 
Py, -y Weel ng 
Na 3, Mo 
B 
Let us stress that in the tree level approximation, partial widths of the decays (11.45) 
and (11.46) are equal to each other, 
T'(Na — hig) =T(Na —> hig). (11.48) 


This is not the case once loop corrections are included. This is very important from 
the viewpoint of the baryon asymmetry generation; see discussion in Sec. 11.4. 
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Problem 11.12. Consider a theory of one fermion field N, (the subscript L will 
be omitted in this and next problem) whose Lagrangian is 


= M =- 
L =i N43 N + NN. 


Obtain the field equation and its positive-energy solutions for |p| < M and |p| > M, 
where p is spatial momentum. Wave functions of which particles do they describe? 


Problem 11.13. Quantize the model of the previous problem. Upon adding the 
Yukawa term (yN°wy + h.c.), where y and y are massless left fermion and scalar, 
respectively, find the decay widths of N — wy and N — we* at the tree level, thus 
confirming (11.47) and (11.48). 


11.3. Asymmetry Generation in Particle Decays 


Theories extending the Standard Model sometimes provide a simple mechanism 
of the baryon asymmetry generation due to particle decays. As we discussed in 
Secs. 11.2.2 and 11.2.3, these theories may contain new particles whose decays vio- 
late baryon and/or lepton numbers. In fact, what is important is (B — L)-violation: 
these decays occur before the electroweak transition, so the electroweak processes 
discussed in Sec. 11.2.1 make the baryon asymmetry equal? to the generated (B—L) 
asymmetry up to a numerical factor of order 1. 

As we discussed in Sec. 11.2.2, scalar bosons of GUTs, S (or vector bosons V), 
may have decay channels 


(1): 3 99 


bie Goat (11.49) 


where q denotes conventional quarks and A is a new fermion neutral with respect to 
SU(3). x SU (2)w x U(1)y. To simplify formulas below we assume (though this is 
not necessary) that the decay S — ql has negligible partial width. Lepton number 
of A is either 0 or +1; we will take La = 0 for definiteness. The values of (B — L) 
of the final states of the first and second type are, respectively, 


2 1 


The boson S is color antitriplet, so there exists its antiparticle, triplet S. Its decay 
channels are 


(11.51) 


°The possibility that the baryon asymmetry is generated in the electroweak processes themselves 
is studied in Sec. 11.5. 
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and (B — L) of final states are given by 
2 1 
(B-L)qy=- 5 and (B-L)a)= 5. (11.52) 


Suppose that at temperatures exceeding the S-boson mass ms, bosons S and S$ 
are in thermal equilibrium. Then their number densities, modulo color and spin 
factors, are the same as those of other particles. As the Universe cools down, thermal 
equilibrium breaks down: S- and S-bosons decay, while the inverse processes of their 
production do not occur. These decays produce (B — L)-asymmetry, provided that 
the probabilities of decays (1) and (2) are not the same as probabilities of (1) and 
(2), respectively. Let us denote partial widths of decays (1), (2), (1) and (2) by Pia), 
Ta), Ta and Ta. (B — L)-asymmetry produced in decays of one S-boson and one 


S-boson is 
1 2 1 2 1 
ô = — || -Traj -—-T —{ -ro-ro 
cm G (1) 3 o) € (1) 3 «)}, 


where Trot is the total S-boson width. The latter equals the total S boson width 
due to CPT-theorem, 


Pie = Tay + ley =F ay ley: 


(We assume for simplicity that there are no other channels.) Making use of the 
latter equality, we obtain for “microscopic asymmetry” 


_, ae Li 


o] 
Tiat 


(11.53) 
The difference between partial widths of the decays (1) and (1) is possible only if 
C and CP are violated; this is the way the second condition of Sec. 11.1 shows up. 

Partial widths of a particle and antiparticle coincide at the tree level, e.g. (see 
Fig. 11.8(a)), 


Pes = qq) = res — qq). (11.54) 


Indeed, modulo one and the same kinematic factor, the tree level probabilities are 
equal to the modulus squared of the coupling gi) in the interaction vertex, which is 
the same for particle and antiparticle. However, the equality (11.54) does not hold, 
generally speaking, at one loop level (cf. Sec. B.4). To obtain nonzero microscopic 
asymmetry (11.53), it is sufficient to have other bosons 5S” with the same quantum 
numbers as S, with the couplings to all these bosons being complex. Note that 
complex couplings mean C'P-violation in models we consider here. At the one loop 
level, the amplitude of decay S — qq is given then by the sum of the diagrams 
shown in Fig. 11.13. (There are other diagrams, but considering these is sufficient 
for the sake of argument.) 
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+ 
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Fig. 11.13. Diagrams for S — qq decay at the one loop level. Bosons S’ have the same quantum 
numbers as S. Shown here are couplings in interaction vertices. 


Modulo common kinematical factor, partial width T) is, at the one loop level, 


Ta) = const - lga) + Dgogenganh 


where D is the one loop Feynman integral for the diagram of Fig. 11.13, and sum- 
mation over all S’-bosons is assumed. The analogous amplitude for antiparticle 
contains complex conjugate couplings, therefore 


Ta = const - |g) + D9(2) 92) 9(1|- 
As a result, the microscopic asymmetry ist? 


Ima 992909) 


ô = —2Im(D 
Co aaa eaP 


(11.55) 
Crudely speaking, this is proportional to coupling constant squared, while Im(D) 
contains small loop factor. Assuming that the phases of couplings g) and gg) are 
neither small nor correlated, we obtain crude estimate, 


ss 2) (4), (11.56) 


An Meg! 


> mg and decays 


~N 


where the function f of mass ratio m;/ms is of order 1 at ms 
as Ms/Ms decreases (fermion masses have been neglected). 


Problem 11.14. Find the asymmetry 6 in the above case assuming that S and 
S’ are scalars. Find its dependence on the ratio of masses of S- and S'- bosons 
assuming Ms Sms (but not necessarily ms & Ms). 


Let us now turn to cosmological generation of (B — L)-asymmetry. The simplest 
case is when S- and S-particles are in thermal equilibrium at T >> ms, while at 
T Smeg their decays are the dominant processes. Both of these properties are non- 
trivial. The first implies that the temperature in the Universe was indeed as high as 
T > msg, and, furthermore, that production and annihilation of S'S-pairs is fast at 
these temperatures. The second is valid only if pair creation and annihilation switch 


10Note that in a theory with one S-boson one would get 6 = 0 at the one loop level. Indeed, in 
that case one should set g(17) = g(1) and g(x) = g(2) in (11.55). 
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off at T < ms, and if production of S and S in processes inverse to (11.49) and 
(11.51) (inverse decays) is negligible at those temperatures. The latter requirement 
is indeed satisfied when the decay width of S-particles is small compared to the 
expansion rate at T ~ ms, 


2 


Tiot S$ H(T ~ ms) = = . (11.57) 
Pl 


If all these conditions are fulfilled, then at T = ms the number density of S-particles 
and their antiparticles is the same as of all other species, ns ~ T°, i.e., 


Ns 1 


S. ge 
where s is the entropy density. The decays of S and § at later times produce the 
asymmetry 


Bg ct Ri (11.58) 


Clearly, the asymmetry is quite large in this simplest case: the required value 
Aps- ~ 1071? is obtained for 6 ~ 1078 (assuming that the number of degrees 
of freedom g, at T ~ mg is similar to its value gSM ~ 100 at T ~ ms). 

The requirement (11.57) implies, in fact, that the S-particle mass is very large. 
Indeed, we have an estimate 


g? 
Pitot ~ —Mms. (11.59) 
4r 


Then the requirement (11.57) gives 


g 
ms Z {-Mbi, (11.60) 


As an example, for g?/47 ~ 107? one obtains ms = 101° GeV. The possibility that 
the Universe had such a large temperature is rather problematic (see accompanying 
book). For smaller couplings the requirement (11.60) is easier to fulfill, but the 
mass of S-particles must be large in this case too. Indeed, without fine-tuning of 
parameters the estimate for the microscopic asymmetry is given by (11.56), i.e., 
ô < g? /4r. Making use of (11.58) one obtains the value A,_, ~ 1071 for g?/4r = 
10-8, i.e., 


Ms 2 101° GeV. 


The general statement is that the asymmetry generation in particle decays can occur 
only for large masses and hence at high temperatures. 

We note that the departure from thermal equilibrium (the third necessary con- 
dition of Sec. 11.1) is achieved rather trivially in the scenario just described: at 
T S ms decay processes occur while inverse decays do not. 
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Let us now consider the case in which the inequality (11.57) is not valid. Let us 
introduce the parameter 


(11.61) 


and let us take it large, 
kK > 1. 


The dependence on this parameter is not particularly strong, so in this case the 
generation of the required (B — L)-asymmetry is also possible. Again, the mass ms 
must be quite large. Unlike in the previous case, we now need not assume that S' and 
S were in thermal equilibrium at T >> ms; furthermore, the maximum temperature 
in the Universe may be even somewhat smaller than ms. 

We will see in the end of this Section that at K >> 1 the estimate for (B — L)- 
asymmetry is 


Aps- = const - (11.62) 


gx K log K’ 
where constant is of order 1. Similar to the small width case (11.57) this estimate 
implies that S-particle mass is large. If all relevant couplings are of the same order, 
then the microscopic asymmetry is again estimated by (11.56), i.e., ô S L, while 
the estimate for the width is given by (11.57). Making use of the definition (11.61) 
we find, modulo logarithm, 


ms Z As-z eM py, 
i.e., we again obtain 
Ms > 10°" GeV. 


This bound is weaker in models where couplings differ by orders of magnitude (e.g., 
couplings gq) and ggv) in Fig. 11.13 are considerably greater than gi) and g(2)), 
but in any case the mass of S-particles must be large. 

To summarize, heavy particle decays provide efficient mechanism of generation 
of (B — L) asymmetry and hence baryon asymmetry. This mechanism may work in 
(B — L)-violating GUTs. It may work also in supersymmetric GUTs, and in that 
case S-particles may be not only bosons but also fermions. Some (but not fatal) 
problem with GUTs is that their mass scale is extremely high (see (11.29)), so the 
maximum temperature in the Universe could be lower than this scale (this is indeed 
the case in most inflationary models). 


Let us obtain the estimate (11.62). Let us assume for the time being that at T < ms 
the dominant processes are decays and inverse decays of S and S. One might think that 
inverse decays of the type 


qq—> S (11.63) 
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should be very rare, since the center-of-mass energy of the two incoming particles must 
be adjusted to the mass of S-particle with precision of order of its width Pot. Still, the 
probability of inverse decays is not small in thermal equilibrium, as the rates of direct 
and inverse processes are equal. Hence, for quarks and leptons in thermal equilibrium the 
number of inverse decays per unit time in comoving volume is 


d(na? inv. e 
doa =Po ni -a?, (11.64) 
where n% is the equilibrium number density of S-particles. 


Problem 11.15. Let us neglect cosmological expansion. Show by explicit calculation that 
the rate of the inverse processes (11.63) is 


dn a 
(2) =I(S >q) ns", 


where T(S — qq) is the partial width of the decay S — qq. Simplifications: Consider low 
temperatures T < ms; do not account for color of quarks and S'-particles. 


S-particle decays give the following contribution to the evolution of their number in 
comoving volume, 


— —Tiop- ns - a’. (11.65) 


By adding (11.64) and (11.65) we obtain the Boltzmann equation for the number density, 


3 
Ansa) Sofie ten =n a) (11.66) 
Let us now consider (B—L) density in comoving volume ns- za”. Without loss of generality 
we assign (B — L) = 0 to S- and S-particles. The Boltzmann equation for (B — L) density 
in comoving volume reads 


ane 
ca a ô -Trot (ng a? — n4t - a) — Trot nii - a? - iat : (11.67) 
dt nq 


The first term in the right-hand side of Eq. (11.67) describes the asymmetry production in 
decays of S- and S-particles, while the second and third terms wash out the asymmetry. 
The second term is induced by C'P-violation in the inverse decay processes. It is non-trivial 
even if (B — L) is zero in the medium, and its form is given by the requirement to cancel 
the first term in termal equilibrium.'! The third term is non-trivial even in the absence of 
the microscopic C'P-violation. It has to do with the following dynamics: if there is (B — L) 
in the medium, it gets washed out due to production of S-particles. Indeed, if the medium 
contains more quarks than antiquarks, then there are more inverse decays qq — S than 


11We simplify the situation slightly: the second term also includes the contribution of resonant 


scattering of fermions, e.g., qq > S — qA. Still, the term is correct, since it follows from the 


requirement that (B — L) is not generated in thermal equilibrium when ns = n4. 
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the conjugate processes gg — S, which reduces the asymmetry. Evolution of (B — L) due 
to the latter mechanism is determined by the equation 


d(ns—1 * a?) 
dt 


where we neglected the CP-asymmetry (it would give the second term in right-handside 
of Eq. (11.67)). To the leading order in asymmetry, ng- K nq, one obtains, 


2 2_4 eq 
Ng — Ng = F NB-L* Ng 3 
3 
where we recall ng — ng = (2/3)ns-z. Since in the termal equilibrium the rates of decays 


and the inverse decays coincide, then 


2.73 2 3 3 (2 2 
= —(Oqqs) + Ng a + (Fgq_43) Ng a = —(Oqq—s) +a - (ng — Ng), 


eq _ eq2 
Prot nS = (Faqs) NG» 


hence 
eq _ 
(Oqq—s) “Ng = Prot: eq? 


and we arrive at the third term in Eq. (11.67) with parameter c = 4/3 ~ 1 accounting for 
the number of decay channels. 
It is convenient to make use of Eq. (11.66) and write 


d(ng— a") d(nsa?) 3 NB-L 
= 5 a tot nf a 11.68 
dt dt e A E a) 
The system of equations (11.66) and (11.68) determines number density of S-particles and 
(B — L) asymmetry at all times if they are known at initial time t;. 
It is convenient to introduce the variables 
Ns 


Nes and Nep E 


=a m (11.69) 


so that nsa? = const - Ns, ng- a? = const - Ng- with one and the same constant. Also, 
let us use, instead of time, the variable 


pms 
ET 
and make use of the relation (3.34) 
T T? 


Recalling that ng x T? we obtain the following form of Eqs. (11.66) and (11.68) 
dNs 


= —Kz(Ns — Né?), (11.70) 
dz 
dNg_—rt _ dNs Po eq 
dz =p: as — Kz. N; - NB-, (11.71) 


where the parameter K is defined in (11.61) and 


We are interested in the case K >> 1, when the relevant temperatures are low, T < ms, 
i.e., z >> 1. The equilibrium number density of S-particles in this regime is (spin and color 
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factors are omitted) 


i.e., modulo irrelevant constant ĉ, we have 
NSS = 623/27 e7*, (11.72) 


Hence, we have explicit expressions for all quantities entering (11.70), (11.71). 
Let us begin with Eq. (11.70). Its solution is 


Ns(z) = f ae OKZ NS (z')dz' F e7 26?-P Ne(zi), 


i 


where z; = = is the initial value of the variable z, at which the initial value of the 
S-particle relative number density is Ns(z:). This solution shows that for K >> 1 the 
initial value of the number density gets irrelevant soon (the second term rapidly tends to 
zero). Now, at K >> 1 the integral is saturated at z’ near z (if N59 (z) is not very small), 
therefore 


Ns(z) = N&?4+0 (z): (11.73) 


Both properties are quite obvious: large K means high rates of decays and inverse decays, 
so S-particles are close to thermal equilibrium. 


Let us turn to Eq. (11.71). Neglecting corrections of order K~* and making use of 
(11.73), we write it in the following form, 


dNg-t 
dz 


dN‘! 


=: — Kz-NS!- Ngor. (11.74) 
Departure from thermal equilibrium that leads to the asymmetry generation is now rather 
subtle: even though the density of S-particles is close to equilibrium at each moment of 


time, it changes in time (and hence in z), so the equilibrium is incomplete. 
The solution to Eq. (11.74) is 


“opan dN! —1( 23,2 
Neue) = 8: f e 1 7) 4e Teiz) Ng- r(zi), (11.75) 


where 


$ 


iade] NS! (z")Kz"dz". 


We see that initial anisotropy is irrelevant at late times, just like the initial S-particle 
density, provided that the initial temperature is of order of ms or only slightly less than 
ms (so that N5? (zi) is not very small). 


Problem 11.16. Let the maximum temperature in the Universe be Ti, and K > 1. Esti- 
mate the maximum value of T; such that most of the initial (B—L)-asymmetry is preserved 
up to the present epoch. 
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Assuming that the initial temperature is sufficiently high, we neglect the second term 
in (11.75) and write for the resulting asymmetry 


co E eq 
eae] o N (11.76) 
0 dz” 


where 


T(z) =I(z,00) = L NS (z')Kz'dz'. 


The integral in (11.76) is saturated at rather large z (i.e., at temperatures well below ms): 
the asymmetry produced at z ~ 1 (i.e., T ~ ms) due to the first term in (11.74) is washed 
out in the course of evolution. We find from (11.72) that at large z 


eq 
As = ê? eT? = NE (2). (11.77) 
Z 


Hence, the integrand in (11.76) is the product of two exponential factors: the decaying 
factor e~* from (11.77) (the decreasing density of S-particles makes the generation of 


(B — L) slower) and increasing factor etl) (wash-out becomes less efficient). Such an 
integral is determined by the behavior of the exponent 


f(z) = Îl) +z. 
This function has a minimum at z = z« such that 
NSI (24) Kz. = 1. (11.78) 
We recall (11.77) and obtain’? (see Sec. 6.1 regarding equations of the latter type) 
zx = log K + O(log log K) = log K + O(log log K). (11.79) 


Both the integral [ (z4) and its first and second derivatives are of order one at the saddle 
point, so 


f (ze) = 2 +00), 
and 

Pf 

7a O(1). 


The latter property means that the integral (11.76) is saturated in the region around 2. 
whose size is 


Azn lz. 
We obtain in the saddle point approximation 
Nsg-z = ô- const - N5 (z4), 


and finally, making use of (11.78), we find 


ô 
Nsg- = const - —— = const - 


ô 
Ka Rog iia 


with constant of order 1. This is precisely the quoted result (11.62). 


12The approximation z. = log K works poorly at moderate K but this is unimportant for our 
purposes. 
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Let us now discuss the result obtained and calculation performed. First, the tempera- 
ture of the asymmetry generation, T+, is determined by (11.79), i.e., 


ms 


Wash-out effect is not very strong at this temperature (Î(z+) ~ 1), while the density of 
S-particles and the number of their decays are still sizeable, 


aNs! 1 


NSS ~ ——. 
7 dz K log K 


It is these properties that make the resulting asymmetry suppressed rather mildly at 
large K. Second, the result (11.80) is valid provided that the maximum temperature in 
the Universe exceeded T, which is somewhat lower than ms. Third, the asymmetry is 
generated in the temperature interval which is small compared to temperature itself, 


AT Az 1 
Te 2 logK` 


Finally, let us point out that the equation we actually used, Eq. (11.74), does not con- 
tain information on the processes that keep the number density of S-particles in thermal 
equilibrium. Instead of decays and inverse decays (or together with them) these may be 
processes of pair creation and annihilation of S and S. It is this situation that happens 
for colored S and § which we considered in Sec. 11.2.2. Still, the result (11.62) remains 
valid for K >> 1. What we have not included into our analysis is non-resonant scattering 
of the type qq —> qA occurring via S-boson exchange. The latter process tends to wash out 
(B — L) and its effect is sometimes important; see Sec. 11.4. 


To end this Section, let us briefly discuss the possibility that baryon asymmetry 
and dark matter are generated by one and the same mechanism. An example is 
decays of hypothetical particles in the early Universe into quarks and dark matter 
particles. This mechanism is known as hylogenesis [217]; various models are consid- 
ered in the review [218]. The idea is that dark matter particles carry negative baryon 
number, and matter in the Universe is neutral with respect to the baryon number. 
Decays of new heavy particles produce the asymmetry in both visible and dark 
sectors. The model has to be constructed in such a way that the lightest particles 
in both sectors are stable. 

The appealing feature of this scenario is that the relic abundances of baryons and 
dark matter particles are of the same order, ng ~ npm, so the relation Qg ~ Opm 
is naturally obtained provided the masses of the stable particles are also of the 
same order, mg ~ mpm. In the simplest versions one has ng = qpmnpm, where 
dpm ~ 1 is the baryonic charge of dark matter particle. Then the dark metter 
particle mass is mpm = MpqoMQpm/QB % 5qpm GeV. These particles may be 
produced at the LHC, and not necessarily in pairs (as opposed to WIMPs). They can 
annihilate with protons in galaxies and give signals in cosmic rays. Standard indirect 
search for these particles is not particularly promising, but their annihilation in an 
underground detector would yield similar signal as proton decay. Search for these 
processes is underway in, e.g., Super-K experiment. 
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As we discussed in Sec. 11.2.3, violation of lepton number, and hence (B — L) 
may be related to nonzero neutrino masses. Hence, it is appealing to explain the 
baryon asymmetry within the same approach that deals with neutrino masses. The 
lepton asymmetry may indeed be generated in decays of N-particles considered in 
Sec. 11.2.3. This asymmetry is then partially reprocessed into baryon asymmetry 
by electroweak processes of Sec. 11.2.1. This scenario [207] is known as leptogenesis. 

The mechanism of the lepton asymmetry generation is basically the same as 
in Sec. 11.3. The only difference is that S-particles are now replaced by Majorana 
fermions Na. The simplest possibility is that the lepton asymmetry is produced in 
decays of the lightest of these particles, otherwise one should worry about wash-out 
of the asymmetry generated in decays of heavier particles by processes involving 
the lightest ones.!3 Let Nj; be the lightest of N-particles. Then the relevant decays 
are 


Nı = lh, (11.82) 
and 
Nı > Íh. (11.83) 


As we discussed in Sec. 11.3, the microscopic asymmetry is generated if their par- 
tial widths are different. This occurs at one loop, if couplings in the Lagrangian 
(11.44) are complex, and hence CP is violated. The relevant diagrams are shown in 
Fig. 11.14. Thus, one loop partial width of decay N; — Lh is given by 


2 


M 
T(N; = lh) = const - X` Jya + D (2) ` YŽgYyaYy6| (11.84) 
a Byy 7 


where M} is mass of particle N (we neglect masses of the Standard Model particles: 
we will see that leptogenesis requires Ma >> 100 GeV), D(Mı/M,) is the sum of 
loop integrals shown in Fig. 11.14. 


Fig. 11.14. Amplitude of the decay Ny, — lah is the sum of tree-level and one loop diagrams. 
Summation over generation indices @ and y is assumed in the latter. Yukawa couplings entering 
the vertices are shown explicitly. 


13Generation of the asymmetry in decays of the next-to-lightest N-particle has been considered, 
e.g., in Ref. [214]. 
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The partial width of Nı — Ih decay is obtained from (11.84) by replacing the 
Yukawa couplings by their complex conjugates, yag > ys gp: We denote 


Mı 1 M, 
Im D | — | = — = 
ii (3E) aa! (r) 


and find the microscopic lepton asymmetry, 


6= 
Dot 
1 M I atar 
=--— yi (2 Ima Natya) (11.85) 
8T a My Solya 


We used here the fact that Im(y1ayža) = 0 for y = 1, so the diagrams with N; in 
the loop do not contribute. We will concentrate on the mass hierarchy Mı < M2.3, 


when 
M, 3 Mı 
Se) a, 11. 
f (3) 2M, en 


So, the microscopic asymmetry becomes 
3M, 1 1 
ô = — =, I a * Yg ||. 11.87 
16r X, [yia]? 3 m n Yip (sia M, va) ( ) 


Problem 11.17. Show by calculating Feynman diagrams that (11.86) is indeed 
valid for My > Mı. 


Let us make the following comment regarding the expressions (11.85) and 
(11.87). They contain combinations of the Yukawa couplings which are different 
from the combinations entering neutrino mass matrix (see (C.69)), 


v? 1 
Mop = -7 Litho age 8 (11.88) 
Y 


As an example, the unitary transformation of the form 
y > yU (11.89) 


changes the neutrino mass matrix mg but leaves intact the expressions (11.85) and 
(11.86). This is not surprising: the transformation (11.89) corresponds to the change 
of basis for lepton fields la, while the asymmetry is insensitive to this basis. (We 
neglect masses of the Standard Model leptons, so all choices of the basis are equiva- 
lent.) This shows that, generally speaking, the asymmetry ô is not directly related to 
the parameters of the Pontecorvo-Maki-Nakagawa-Sakata (PMNS) matrix which 
describes mixing between conventional neutrinos and responsible for neutrino oscil- 
lations. Therefore, measurements of neutrino oscillation parameters does not tell 
whether there is asymmetry in N-particle decays. Still, the very fact that oscillations 


11.4. Baryon Asymmetry and Neutrino Masses: Leptogenesis 347 


exist suggests that the matrix of Yukawa couplings Yag has non-trivial structure. 
Additional hint towards asymmetry in N-particle decays would be given by the 
observation of C'P-violation in neutrino oscillations: it would show that elements 
of neutrino mass matrix, and hence Yukawa couplings, are complex at least in the 
gauge basis. 

We will get back to the expression (11.87) for the microscopic asymmetry, and 
now we turn to the cosmological lepton asymmetry generation in decays of particles 
N,. The analysis repeats the treatment in Sec. 11.3 word by word, so we simply use 
the results of that Section. The generation is most efficient for 


Trot(Mi) < H(T = M1), (11.90) 


but one has to assume in that case that particles N; are produced in cosmic plasma 
at T >> Mı by interactions other than Yukawa interactions (11.44). We recall that 


and that H = T?/Mj,, and find that the relation (11.90) can be written as 


Mmi K ——-v*~ 107? eV, (11.91) 
Pl 
where 
iu => Mial (11.92) 
— 2M, 


is the sum of absolute values of the Nı-particle contribution into neutrino mass 
matrix (see Section (C.4)). We see that the case under study requires strong hierar- 
chy of Yukawa couplings:!4 if all yag were of the same order, then the contributions 
of the lightest particle Nı into mass matrix (11.88) would be the largest, and all 
neutrinos would have masses below 107° eV, in contradiction to experiment (see the 
discussion in Appendix C before Eq. (C.57)). Still, let us continue with this case and 
obtain the estimate for the mass Mj, assuming for definiteness the direct neutrino 
mass hierarchy (C.59) with small mass of the lightest mass eigenstate. Once the 
inequality (11.90) holds, the estimate for the lepton (and hence baryon) asymmetry 
is (see (11.58)) 


ô 
Ba 
gx 


14This possibility is not particularly natural, but there is no reason to discard it; recall that there 
is strong hierarchy of the Yukawa couplings of quarks and leptons in the Standard Model. 
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Now, the formula (11.87) can be written as 


3M, ( My 
ô= I a ane 11.93 
~ 82 Ya lyra? me P 2 i (r u demas 
where 
2 
me = v 
Ma Ya YyB (11.94) 
Bo ENIM T 


is the contribution of N-particle into neutrino mass matrix. Making use of (C.59) 
we find 

5< 3 Mı 
X“ 8rv? 
The asymmetry A, ~ 1071? is obtained for 6 ~ 1078 (assuming that g, ~ 100 like 
in the Standard Model), so 


Matm- 


M, = 10° GeV. 


Without fine tuning of parameters this gives the minimum mass scale of N-particles 
in the leptogenesis scenario; the maximum temperature in the Universe must exceed 
10° GeV. 

Probably the most natural possibility is that there is direct neutrino mass hierar- 
chy (C.59) related to inverse mass hierarchy of Na, so that m3 œ M;', me x M;", 
mı x M3 with Mı < Mz X M3. In that case the right-hand side of (11.92) is 
estimated as 


mi ~ Matm ~ 0.05 eV, (11.95) 


so that the inequality (11.91) and hence (11.90) do not hold. Then the generation 
of lepton asymmetry is suppressed. Let us use the estimate (11.62) and write 


ô 
Ar S t- ——— 
1, & cons aE leak’ 
where the constant is of order 1, and 
To n Mt 
ce er (11.96) 


A(T ~ Mı) Arv? 
Using the estimate (11.95) we see that the suppression factor is of order 
K ~ 100, 
so the correct value A, ~ 1071? is obtained for larger microscopic asymmetry 
ô 2 107. (11.97) 


On the other hand, for y = 2,3 the right-hand side of (11.94) is estimated as 
m ~ Msol, SO (11.93) and (11.97) give 


M, > 10” GeV. 
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Fig. 11.15. Lepton scattering off scalars that washes out the lepton asymmetry. 


We see that even in the least favorable case the required mass scale is not unac- 
ceptably large. We note that in the case we study here, one does not need to invoke 
extra mechanisms of N-particle production: they are efficiently produced in inverse 
decays. What is required is that the maximum temperature in the Universe exceeds 
(see (11.81)) 


ro 
~ log K 


Thus, leptogenesis scenario indeed works in a wide range of parameters. Inter- 


estingly, the range of neutrino masses suggested by the oscillation experiments is 


not very far from aroi ~ 107? eV, so that the suppression factor (11.96) is not 
Pl 


very large in any case. In other words, neutrino masses m < 1 eV are such that 
there is crude (within two or three orders of magnitude) equality between the cos- 
mological expansion rate at leptogenesis and the width of the lightest N-particle, 
Trot ~ (1 — 1000) - H(T ~ Mı). This coincidence is a rather strong argument in 
favor of leptogenesis. 

If neutrino masses are much larger than Matm œ 0.05 eV, they must be degen- 
erate. In that case one obtains interesting bounds from the non-resonant scattering 
shown in Fig. 11.15, 


lh — th, (11.98) 


and crossing processes. These tend to wash out the lepton asymmetry. The new 
contribution to the Boltzmann equation for lepton number density, as compared to 
Sec. 11.3, is 


73 
d(nz - a") x Pin: (nz a3), 


where Ta is the rate of processes of the type (11.98). In terms of the variable 
z = M,/T this contribution reads 


dN, Mz, 
— Tin(z)- Nz, 
dz ~~ Mri?) Nt 
where N; = +4. Wash-out of the lepton number is important after the time at 


which the asymmetry is generated, i.e., at z Z z = log K (see Sec. 11.3). Since 2, 
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is typically large, the N-particles are non-relativistic at z > z, and the cross-sections 
of the processes (11.98) are estimated as 


Olin = const - X 


apy 


2 


YyaYyB 
My 


(Recall that N-particles are fermions; their propagator at low momenta is um) 
Making use of the expression (C.69) for neutrino mass matrix, the cross-section is 
cast into the form 


Tr(mmt) 1 2 
Jin = cons = const -7 X mi, ( ) 


It follows already from dimensional argument that 
Tın = const + oip - T3. 


As a result, we find that at z > z,, when decays and inverse decays are switched 
off, lepton asymmetry obeys 


dN, M} T? E m? M} Mi E m? 
dz = —const - “M, ui . N, = —const - “a 


where the constant is of order 1. We see that the scattering processes suppress the 


-N,, (11.100) 


asymmetry by a factor 


oo t 2 
exp @i da MiMi 27) 


U 
t 2 
= exp (- — - MžıMı - L), (11.101) 


Let us require that this factor is not very small. This gives 
Dms LEN 
pi 

Even for relatively small Mı ~ 10° GeV when z, ~ 1 (the case (11.91)) the resulting 
bound is quite strong (we take into account neutrino degeneracy) 

my S1evV. (11.102) 
In the case Mı > 101? GeV (and z, ~ 10) this bound is even stronger, 

my S 0.1 eV. 
Accurate calculations of the wash-out effect in scattering give [208] 


my < 0.12 eV (11.103) 


for most values of the parameters of the model. This bound does not contradict 
existing experimental and cosmological bounds and again suggests that the neu- 
trino masses are just right for leptogenesis. On the other hand, if double-beta decay 
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experiments would measure neutrino mass above the bound (11.103), the leptogen- 
esis scenario! will be much less attractive. 


Problem 11.18. Discarding experimental bounds on neutrino masses and neutrino 
oscillation data, and assuming that all Yukawa couplings are of the same order, show 
that successful leptogenesis is possible only for 


my S1 eV. 


~N 


There is an alternative to the leptogenesis in decays. Lepton asymmetry may instead be 
generated in oscillations of active and sterile neutrinos in the Universe. This mechanism 
does not require very heavy sterile neutrinos and, in fact, works if their masses are well 
below the electroweak scale. The idea is to redistribute the lepton number, by employing 
C P-violation, between active and sterile neutrinos [219]. (The role of lepton number is 
played in the case of sterile neutrinos by their helicity, which is conserved at high energies.) 
The oscillations do not produce the net lepton number, but part of lepton number of active 
sector is reprocessed into baryon number by sphalerons. 

This mechanism is at work if the sterile neutrinos are not in thermal equilibrium by the 
epoch of the electroweak transition, otherwise the asymmetry would be washed out. The 
production rate of sterile neutrinos Ny in particle collisions is estimated on dimensional 
grounds as Py ~ T|yya|*/(87), cf. (11.47). As usual, the sterile neutrinos enter the thermal 
equilibrium when this rate becomes comparable with the Hubble rate H = T? /Mpi, ie., 
at temperature 


Tx! ~ Mp |yyal”/(87). 
The condition Ty’ < 100 GeV is satisfied for small Yukawa couplings 
lial” K 107". (11.104) 


If sterile neutrinos give masses to active ones via the see-saw mechanism, then, without 
fine tuning of parameters, one has the estimate (C.68), which gives |yya|? ~ Mym, /v?. 
The latter shows that the condition Ty" < 100 GeV holds for sterile neutrino masses below 
the electroweak scale, Mn < 10 GeV. 

The generation of asymmetry requires fast C P-violating processes due to complex 
Yukawa matrix yya. Like in the case of decays, these processes occur at one loop level; see 
Eq. (11.85). So, they are suppressed by the same parameter |yya|? which, by the require- 
ment of inequilibrium (11.104), must be very small. Thus, one needs the enhancement of 
the CP-violation in oscillations. A mechanism for this enhancement is the resonance in 
the coherent system of two sterile neutrinos N1, N2, which should be degenerate in mass, 
AM « Mi,.2 = M. The resonance occurs when the oscillation frquency 


w ~ |M? — M3|/T ~ AM. MJT 


is of order of the expansion rate, w(T) ~ H(T). This gives for the temperature of the 
resonance 


Tres ~ M - (M$, AM/M?)*8, 


At earlier times, when w < H, the resonance does not have time to develop, while later on, 
when w > H, the oscillations are too fast, and the small factor |yyq|? is not compensated 


15We talk here about leptogenesis at the hot Big Bang stage (“thermal leptogenesis” ). The analysis 
in the text does not apply to leptogenesis at reheating which we do not consider here. 
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for. It has been shown that the mechanism is efficient for Tes in the hundred GeV range; 
in that case the asymmetry is large, Az ~ 1 [220], but this requires strong degeneracy in 
sterile neutrino masses, AM/M « 1. 

We note that the generation of lepton asymmetry may happen even after the elec- 
troweak transition. This asymmetry is not reprocessed into baryon asymmetry, but it may 
help to produce sterile neutrinos serving as dark matter candidatesl; see Sec. 7.3. 


11.5. Electroweak Baryogenesis 


It is natural to ask whether electroweak baryon number violation discussed in 
Sec. 11.2.1 can by itself generate baryon asymmetry at temperature of order 
100 GeV. Such a possibility — electroweak baryogenesis — is of particular interest 
because the relevant energy range of 100 GeV — 1 TeV is accessible to collider exper- 
iments. Examining this energy range will either rule out or give strong arguments 
in favor of electroweak baryogenesis in our Universe. 

Both the requirement of strong enough CP-violation and the thermal inequi- 
librium condition are quite non-trivial in this scenario. C'P-violation in the CKM 
matrix of the Standard Model is insufficient for baryogenesis; strong deviation from 
thermal equilibrium is also not inherent in the Standard Model. However, some 
extensions of the Standard Model have both extra sources of C'P-violation and pos- 
sibility of enhancement of thermal inequilibrium at the electroweak transition, so 
the electroweak baryogenesis still remains an option. 


11.5.1. Departure from thermal equilibrium 


The Universe expansion at temperatures of order 100 GeV is pretty slow. The 
Hubble time is 


tya H =— w10" Gey 210-5: 


This is much longer than the time scale of electroweak interactions between particles 
in the medium, 


1 z E 
tint ~ p ~ 1 GeV 110 g: 


Qw 

Hence, one of the necessary conditions of Sec. 11.1, strong deviation from thermal 
equilibrium, is very non-trivial. It appears that the only way this condition can be 
satisfied is that the electroweak phase transition is first order. As we discussed in 
Sec. 10.1, first order phase transition occurs via spontaneous creation of bubbles 
of the new phase, their subsequent growth and percolation. This boiling of matter 
in the Universe is a strongly inequilibrium phenomenon. We will see that baryon 
asymmetry may indeed be generated in this process. 

Even the requirement that the electroweak phase transition is first order is insuf- 
ficient. Medium after the phase transition quickly equilibrates, and the asymmetry 
generated in the course of transition may be washed out. This does not happen only 
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if the electroweak baryon number violation rate is smaller than the expansion rate 
after the transition. As we will now see, the latter requirement is not fulfilled in the 
Standard Model (in fact, there is no phase transition in the Standard Model at all), 
but can be satisfied in some of its extensions. 

Electroweak baryon number violation is switched off in the broken phase after 
the phase transition, if the inequality inverse to (11.22) holds, 


Mw(T) . 0.66 


T ~“ B(ma/Mw) 


Recalling that Mw(T) = 9¢(T)/V2, aw = g?/4r ~ 1/30 and B ~ 2.4, we find that 
the latter inequality gives!® 


(11.105) 


where Te and ®, are the phase transition temperature and the Englert—Brout-— 
Higgs expectation value just after the transition. The requirement (11.105) strongly 
restricts the physics at the electroweak scale. To see this, we make use of the one- 
loop results of Sec. 10.2. In the one loop approximation we have 


where AÀ is the Higgs self-coupling, the parameter y is defined by (10.40), and the 
constant c ranges from 1/2 to 3/4 depending on how delayed the phase transition 
is (see Eqs. (10.44), (10.45)). Making use of the relation between the Higgs boson 
mass and self-coupling, mp = V2Av, we obtain from (11.105) a bound on the Higgs 
boson mass, 


2 2 1 3 

Gf Oe 2, ami, (11.106) 
where the realistic value is c = 1/2. Within the Standard Model with one Englert- 
Brout-Higgs doublet, the right-hand side obtains contributions from W- and Z- 
bosons of masses 80.4 GeV and 91.2 GeV and numbers of spin states g = 6 and 
g = 3, respectively. (The contribution of the Higgs boson itself may be neglected for 
estimates.) Recalling that v = 246 GeV, we obtain from (11.106) that the bound in 
the Standard Model is [209] 


mp < 50 GeV. 


This is not the case in Nature: the mass of the Higgs boson is mp, = 125 GeV. Thus, 
electroweak baryogenesis is impossible within the Standard Model. 


16We recall that the normalization of the field ¢ adopted in Sec. 10 is such that () = v/ v2. 
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In fact, a less restrictive requirement of the first order phase transition is not 
satisfied in the Standard Model as well. We pointed out in Sec. 10.3 that the elec- 
troweak transition is a smooth crossover. The cosmic medium always remains close 
to thermal equilibrium, and baryon asymmetry is not generated at all. 

This problem can be overcome in some extensions of the Standard Model. One 
example is given in Sec. 10.2. Another possibility exists (yet [221]) in supersym- 
metric extension of the Standard Model with light superpartners £ of t-quark [210]. 
Their explicit (“soft”) masses are small, and ¢ acquire masses due to interactions 
with the Englert-Brout—Higgs field. Their couplings with this field are equal to the 
t-quark Yukawa coupling, and hence 


milo) = yo. 


Superpartners ¢ form two complex, color-triplet scalars. They contribute to the 
right-hand side of Eq. (11.106) with g; = 12. Due to this contribution, the inequality 
like (11.106) can hold for mp, = 125 GeV. Other extensions of the Standard Model 
with scalar singlets and/or additional scalar doublets may have similar properties. 


11.5.2. *Thick wall baryogenesis 


As we discussed in Sec. 10.1, bubbles of the new phase spontaneously created at the 
first order phase transition expand to macroscopic sizes before their walls collide. 
Hence, most of the matter in the Universe interacts with the walls, while the regions 
where the walls collide have small volume. (The ratio of the corresponding volumes 
is d/R, where d is wall thickness and R is the bubble size at percolation.) Thus, 
when calculating the baryon asymmetry one neglects processes occurring in wall 
collisions; the baryon asymmetry is generated in interactions of matter with bubble 
walls. Importantly, the walls rapidly move through the medium; their velocity is 
determined by “friction” produced by the medium and in most models is of order 
0.1 of the speed of light. 

In this and next Sections we describe mechanisms of baryon asymmetry genera- 
tion in interactions of cosmic plasma particles with moving bubble wall. Our study 
will be rather sketchy, since we will not take into account a number of effects that 
are of various degree of importance for this quite complicated dynamical process. 
For details see, e.g., Ref. [211]. 

A fairly simple, though not completely realistic mechanism of electroweak baryo- 
genesis works in so called adiabatic regime. Let us assume that the thickness of a 
bubble wall is much larger than the mean free path of particles in cosmic plasma. 
Let us also assume that the wall moves through the plasma sufficiently slowly. Then 
at any given time and everywhere in space (including the region inside the wall), 
the medium is in local thermal equilibrium with respect to fast processes of elastic 
scattering, pair creation and annihilation, etc. On the other hand, the electroweak 
baryon number violating processes are not so fast even in the unbroken phase, as 
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can be seen from (11.19). So, it is natural that the latter processes may be out of 
thermal equilibrium. This may lead to the baryon asymmetry generation. 

We are going to illustrate this mechanism making use of a simplified model with 
the gauge group SU(2),, two Englert-Brout-Higgs doublets Hı and Hz and one 
pair of fermions!” Q, (doublet) and qr (singlet), whose Yukawa interactions are 
similar to those of the Standard Model, 


Lint = hQr Hiqr + h.c. (11.107) 


In what follows we take the coupling hı real (by redefining the scalar doublets 
one can always set the Yukawa coupling with Hə equal to zero and hı real). The 
theory with two or more scalar doublets contains extra, with respect to the Standard 
Model, source of CP-violation in the Englert-Brout-Higgs sector. Both in broken 
phase and in the wall region, the fields Hı and Hə have expectation values 


0 
(1,2) = = (11.108) 


where ¢1,2 are complex functions of the coordinate normal to the wall. One of the 

phases of these complex functions is unphysical, and another is a source of CP- 

violation in the wall region. To see this, we recall that under gauge transformations 
ag 

tas 


with gauge function e the fields transform as follows: 


pı => À Fpi, $2 > À? go. 


Therefore, one of the phases is would-be Nambu-Goldstone field which is “eaten 
up” due to the Englert-Brout-Higgs mechanism and becomes the longitudinal com- 
ponent of massive vector field (Z-boson in the Standard Model). Another phase is a 
physical field. To see how the phases 0; and 62 are expressed through this physical 
field, we consider the terms in the Lagrangian which lead to the mass term of vector 
boson, 


Lm =D, HID, H, + D, HD, Ho, 


where D H1 2 = 0,H1,2 — ig% A? Hi2. (We write the term with an analog of the 
field Z,, only.) Near the vacuum ġı = v1/ V2, ¢2 = v2/V2 the fields have the 


17Tn realistic extensions of the Standard Model the major effect comes from the Yukawa interactions 
of t-quark, and our simplified model is thus adequate. 


356 Generation of Baryon Asymmetry 


following form: 


therefore 


2 1 
Lm = TAT = 5 AR (070.01 + 03002) + 5 [V (3,01)? + 03(062)"]. 
Let us choose the unitary gauge in which the field AS describes physical vector 
boson. To this end, we require that there is no mixing between A and 6; 9. This is 
achieved by imposing the gauge condition v70) +v302 = 0. Then the phases are [223] 


b = 5 
— ? 
v? + 3 
vi 
02 = — 2 2") 
vi + U3 


where 0 = 6; — 02 is the relative phase of the fields ¢; and ¢2, which is the physical 
field. 

In the presence of the background fields (11.108) the quadratic fermion 
Lagrangian, besides the kinetic term, contains the term 


Ly = hig. pie dr + hic, 


where qz is the lower component of the doublet Q_,. 

If there is CP-violation in the scalar sector, then both pı and 6; vary across 
the wall. Since the wall moves, both pı and 6; depend on time at a given point of 
space. Therefore, the fermion Lagrangian depends on time, 


Ly = hgp (tt Og, + hc. (11.109) 


Time-dependent phase 6;(t) leads to CP-violation in the fermion sector and in the 
end to the baryon asymmetry, once Q, and qpr are identified with quarks. 

The simplest way to analyse this mechanism is to perform time-dependent phase 
rotation of the field!® qpr 


01 O gr. (11.110) 


qr =e 


This phase rotation makes the Yukawa coupling 01-independent, but induces extra 
term in the fermion Lagrangian, as the kinetic term gives 


ifr" Oude => ifr" Oude T rP nO. 


18 The phase rotation of Qz is an anomalous symmetry, so if we performed the rotation of left 
quarks rather than right quarks, there would appear 01-dependent term in the effective Lagrangian 
of gauge fields. This would be a different, though equivalent approach. We are not going to use it. 
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The latter term modifies the Hamiltonian as follows, 
H > H—6,Np, (11.111) 


where 
N= fatus 


is the right quark number operator. 

We are assuming that transitions from right to left quark (e.g., qr > Q, +H) are 
fast processes as compared to the rate of time-variation of 64, while the anomalous 
baryon number processes are not. If we neglect the latter, the baryon number density 
vanishes. In the case of the Hamiltonian (11.111) this means that there is chemical 
potential jz, to baryon number, which is the only conserved quantum number in 
our model. Hence, the calculation of the free energy proceeds by replacing the 
Hamiltonian (11.111) with 


H=6,Ne—jis(NetN,), (11.112) 


where (Ng + N,) = B is the baryon number. Hence, the effective chemical potential 
for right quarks equals (4s + 41), while it is equal to 4ps for left quarks. Using the 
result of Problem 5.4, we find the expression for the baryon number, 


As = Ar + As = E [ (un +01) +245], 


where we accounted for two left quark species, hence the factor 2 in the last term. 
We now require A, = 0 and obtain 


ie 
HB = -301 
Let us now turn on the electroweak processes that violate the baryon number. Since 
there is non-vanishing chemical potential us, they generate the baryon number by 
partially washing out 4s. Let us make use of Eqs. (5.53) and (5.54) and write 


dng AF-AB 


a a d i 
dt m 

where AF and AB are the changes of free energy and baryon number at one 
sphaleron process, and Isp, is the rate of the sphaleron processes. For one left 


quark doublet (color triplet) AB = 1 and AF = us : AB = up. Thus, 


dns Hay _ 10 
d foe sy 


Laph: (11.113) 


Here we use the assumption that the sphaleron processes are slow, and hence we 
neglect their effect on the evolution of us. In this way we obtain baryon number 
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density generated by the wall passing through a given region of space, 


1 


= — | Paon (tdt. 11.114 
3T 1 ph(t) ( ) 


Np 
Here I'spn depends on time, since the background values of the scalar fields at a 
given point in space depend on time. As a reasonable approximation we take T sph 
given by (11.19), 


Cpe, 2 eis 
so long as the combination 4/|¢1|? + |¢2|?, which determines the vector boson 
masses, is relatively small, \/|¢1|? + |¢2|? < T, and T's), = 0 after that (see 
Eq. (11.105); the W-boson mass in the two-Higgs doublet model is determined by 
v = /|¢1|? + |¢2|?). We recall the formula for entropy density s = (2r? /45)g4 T’ 
and obtain the estimate 


5 5 2 
TE aig) AG, ag 8 AG (11.115) 
8 gx gx VI + VQ 


where A@ is the change in the relative phase of the scalar fields from the unbroken 
phase to the point where sphaleron transitions switch off. For gx ~ 100, aw œ 1/30 
and vı ~ v2 we find 


n 


—= ~ 1078. Ad, (11.116) 
S 


which is quite acceptable for generating the required asymmetry (11.1). This esti- 
mate is indeed valid in realistic extensions of the Standard Model modulo the numer- 
ical factor in (11.114). The dominant effect there often comes from t-quark. 

The necessary conditions of asymmetry generation are fulfilled here in the 
following way: 


(1) Baryon number is violated because I's), is nonzero. 

(2) The source of CP-violation is the time-dependent phase 0. This is an additional 
source of CP-violation as compared to the Standard Model where CP is vio- 
lated only in the quark mixing matrix. This is a general situation: electroweak 
baryogenesis requires extra source of CP-violation. 

(3) Departure from thermal equilibrium occurs due to the time-dependence of the 
phase 0 and low rate of electroweak baryon number violating processes. 


Problem 11.19. Consider the interaction of the SU(2)w gauge field with a new 


scalar field p described by the Lagrangian 
9? 5 
Z . pv ayra 
L= fp) Have V 


Let the background field f rapidly evolve at the electroweak transition epoch. Esti- 
mate the baryon asymmetry produced in that epoch. 
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To end this Section, let us see that the time-dependence of the relative phase of the Higgs 
fields is indeed possible. Let us consider the scalar potential of the form 


V (Hi, Hə) = Vi (HÍ Hı) + Vo(H3 Hə) 
+ A+[Re(H!H1) — vive cos 2€]? + A- [Im(H} H1) — viv sin 2€]”, 


where the minima of the functions Vi and V2 are at |¢i| = vı and |¢2| = v2, respectively, 
and à+, € are dimensionless parameters. For A+ > 0 the scalar potential has its minimum at 


pı =v, d2=e “vp, 
which corresponds to the vacuum phase 
Ovac = & 


Let the phase transition occur in such a way that both Higgs fields develop the expectation 
values. ¢1 and ¢2 are small at the beginning of the transition, so the relevant part of the 
potential is quadratic in the fields, 


Vers = Vi ess (101?) + Va,erz(IG2l”) — viv2Re($2¢1 - Ae”), (11.117) 
where 
Ae” = Ax cos 2E + ià- sin 2E, 


i.e., 
\- 
tan 2¢ = —tan 2€. 
A+ 


The minimum of the potential (11.117) with respect to the relative phase is at 
Ge= C3 


This phase gives the direction in the space of the Higgs fields along which the fields roll 
in the beginning of the phase transition, i.e., in the wall region near the unbroken phase. 
Hence, the phase 0 changes across the wall from 6; = ¢ (front, near the unbroken phase) 
to vac = € (back, near the broken phase), as required. 


In fact, the above analysis is not quite realistic. The medium in and around the 
domain wall region is strongly inhomogeneous; this is seen from (11.112), where the 
chemical potentials zg and (ug +01) depend on both time and spatial location. An 
important phenomena in inhomogeneous medium are diffusion processes which we 
totally ignored. Diffusion changes the overall picture of the baryon asymmetry gen- 
eration [222, 224]. Nevertheless, an order-of-magnitude estimate (11.115) remains 
valid, although the dependence of the baryon asymmetry on parameters (e.g., par- 
ticle masses) is more complicated. The state-of-art approaches to the analysis of the 
baryon asymmetry generation are described, e.g., in the review [225]. 
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11.5.3. *Thin wall case 


The case when the wall thickness is small compared to the mean free path of particles 
is more realistic, but at the same time more complicated. We will discuss here the 
most important physical processes leading to the thin wall baryogenesis, leaving 
aside the detailed analysis. 

We are going to use the same model as in previous Section, but with different 
values of parameters. As before, we assume that CP-violation occurs due to the 
phase of the scalar field that varies in space and time. We will work in the wall 
rest frame for the time being, so instead of (11.109) we have now the following 
Lagrangian for fermions interacting with the wall, 


Ly = hidzp(z)e®™ an + h.c., (11.118) 


where the coordinate z is normal to the wall. The function p(z) changes from zero 
(as z = —oo, unbroken phase, we consider the wall moving from right to left) to ®, 
(as z = +00, broken phase); according to (11.105) we have 


T. (11.119) 


Let us consider fermion with small Yukawa coupling,!? hı < 1. Then its effective 
mass in the broken phase is also small, ms = hı®e < T. We will make our order- 
of-magnitude estimates for the wall thickness of the order of inverse temperature, 


las Th, (11.120) 


We will also assume (and this is indeed the case) that the wall velocity is small, 
Uw <1. Finally, without loss of generality we set the phase 6(z) equal to zero in 
the broken phase, 0(z) = 0 as z > oo. 

Some fraction of fermions moving towards the wall from the unbroken phase gets 
reflected by the wall. Since the phase @(z) depends on z, the reflection coefficients 
R, and R, for the left fermion and its antiparticle are different. Particles whose 
momenta p, well exceed the inverse wall thickness do not experience the reflection 
(they are well described in the WKB approximation), while particles with lower 
momenta do. Still, the reflection coefficient for the latter is small, R « 1: the 
height of the energy barrier is equal to my, which is small compared to p, for 
typical momenta p, ~ l;,'. On the other hand, particles with p, < my do not 
penetrate through the wall at all. Hence, the important range of momenta is 


my < pz < l3. (11.121) 


In this range, we can use perturbation theory in hı, so that the reflection amplitude 
is of order hi, and reflection probability (amplitude squared) is of order h?. The 


19This is not the case for t-quark, which requires special treatment. 
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asymmetry of reflection is thus roughly estimated as 
R, — R, ~ hiOop, (11.122) 


where cp is determined by the variation of the phase 0 in the wall region. 

It is useful to note at this stage that left fermion after having been reflected 
becomes right fermion; see below. 

If the wall is at rest, the system is in thermal equilibrium, and C P-asymmetric 
flux of left and right particles vanishes. This is because the asymmetries of fluxes 
of reflected and transmitted particles cancel each other. This is no longer the case 
for moving wall: the flux of right particles from the wall to the symmetric phase 
is different from the flux of their antiparticles (or vice versa). The asymmetry is 
proportional to the wall velocity. To estimate this asymmetry, we note that momenta 
Px, Py Of the reflected particles are arbitrary, while p, is in the range (11.121). 
Hence, we have the estimate for the asymmetry in the flux of right particles and 
their antiparticles 


Jn ~ VT’ [Ri — Rr] (11.123) 


perl’? 


where we take into account that right particles are left before reflection. Here T?1,,1 
is the number density of particles of relevant momenta, and the factor Vw accounts 
for the fact that the asymmetry vanishes in the static limit vw — 0. The flux of left 
reflected particles is 


i= =y: (11.124) 


Hence, there is an excess of right particles and deficit of left ones (if Jz > 0) in 
unbroken phase in front of the wall. The opposite situation takes place behind the 
wall in broken phase. The moving wall separates particles of different types. 

Once the inequality (11.105) holds, the asymmetry behind the wall does not lead 
to baryon and lepton number violation. In front of the wall it does: even though the 
overall fluxes of baryon and lepton numbers are zero according to (11.124), there 
is deficit of left fermions there. These are precisely the particles that participate in 
electroweak sphaleron processes. Hence, the latter processes tend to wash out the 
deficit of left fermions, thus partially equilibrating the system and generating the 
net baryon asymmetry. 

The asymmetric fermion reflection changes the medium locally, at distance | 
from the wall. (We will soon estimate the value of l.) At a given point of space, the 
time interval from the appearance of the asymmetry to the moment when the wall 
passes through that point equals t = l/vw. The baryon number violating processes 
occur precisely during this time interval. To estimate l we notice that in time t, the 
reflected particle experiences t/t, collisions and, according to the Brownian law, 


362 Generation of Baryon Asymmetry 


moves away to distance 
lala 3 (11.125) 
ty 
where ty and lp are mean free time and mean free path of the particle; lf = ty in 


our relativistic case. Together with t = l/vw, Eq. (11.125) gives 


pot, ga (11.126) 


Uw v2, 


The excess of left particles generated in the region of size l by collisions with the 
wall in time t (per unit wall area) is of order 


N, = J, +t, 
so the excess number density is 
E 7J. 
i Sys = = (11.127) 


The sphaleron processes are typically so slow that they wash out a small fraction 
of the excess (11.127). Then (see (11.114)) 


dng HL 
=T; ; 
dt eee 


where for relativistic particles u, ~ n,/T?. This gives 
ind 
lp™~ pa T sph t, 
where the time interval during which the sphaleron processes work is given by 
(11.126). Combining formulas (11.123), (11.124), (11.126), (11.127) and recalling 


that the entropy density is s ~ gT’, we obtain the baryon asymmetry 
n 1 tf T sph — 


v2,.gx ly T4 [Rs 7 rl pent) 


(11.128) 


Now, the mean free time of left leptons is of order 
tp ~ (awT). 


The mean free time of quarks is shorter due to collisions caused by strong inter- 
actions, so the most important? is 7-lepton: indeed, according to (11.122), the 


20Our analysis does not apply to t-quark, in particular because left t-quark quickly becomes right 
due to the Yukawa interaction. Hence, t-quark does not play major role in the mechanism under 
study. 
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asymmetry is larger for larger hy = “. We use Pspn = x'a% T4, ly ~ T7! as well 
as the estimate (11.122) in (11.128) and find for 7-lepton 


{nyo 


un Qw 2 
5 ft HOP, 
wdI* 


Ar~ 


2 
uir 


where h2 = Z4 ~ 1074. For vy ~ 3- 107? (which is quite realistic) and 3’ ~ 18 we 
obtain numerically 


Az ~ 107 ‘Oop. (11.129) 


This is sufficient for the generation of the realistic asymmetry. 

In our analysis, we have not taken into account a number of important effects 
such as dynamical masses of particles, conservation of (B—L) and weak hypercharge, 
Debye screening of gauge charges in the plasma, etc. Like in Sec. 11.5.2, diffusion is 
also important. Nevertheless, our order of magnitude estimate (11.129) is roughly 
correct, so the mechanism we described is indeed efficient. 


Problem 11.20. At what wall velocities is the approximation of slow sphaleron 
processes, made in the text, valid? 


Problem 11.21. At what wall velocities are the transitions of left T-lepton into 
right one, Te + Z > Tr +h, irrelevant? 


Let us refine the estimate (11.122). To do that, we need to solve the Dirac equation for the 
fermion that interacts with the wall according to (11.118). Proceeding to work in the wall 
rest frame, we make the Lorentz transformation along the wall after which the fermion 
motion is normal to the wall. We use the chiral (Weyl) representation for y-matrices. In 
our reference frame 0:~ = O2wW = 0, so the term (11.118) leads to the following form of 
the Dirac equation, 


iðoqr + io”ðzqr + m*(z)qz = 0, (11.130) 
100g — io Ozqr + m(z)qr = 0, (11.131) 


where m(z) = hipe’’. The wave function of left fermion incident on the wall from the left 
(i.e., from unbroken phase where m(z) = 0), is 


; safes 0 
qi” = qo ett 7 (o) zZ— —00, (11.132) 


where p = w > 0. Indeed, this is precisely the solution to Eq. (11.131) with m = 0. To cal- 
culate the reflection coefficient of left fermion, we have to find the solution to Eqs. (11.130), 
(11.131), which contains the incoming wave (11.132), outgoing wave at z + —oo that moves 
to the left, and only outgoing wave at z — +00. We see from Eqs. (11.130), (11.131) that 
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the solution has the structure 


qu =e p(z); (') » gr=e dbr(z)- a: 


The complex functions Yz and Yr obey 


~idzp + wip +m*(z)b, = 0, (11.133) 


The reflected wave behaves at z > —oo as w œx e’?*. We see that the reflected wave is 
right, since Eq. (11.134) with m = 0 does not admit solutions of this form. This general 
result is actually a consequence of angular momentum conservation. Hence, the reflected 
wave is 


Wr = Ae, 2-00, (11.135) 


where A is the reflection amplitude, and p = w. To find the amplitude A we make use of 
perturbation theory in m(z). At zeroth order we have Yr, = e '?*, Wr = 0. At the first 
order, Wr is determined from Eq. (11.133), which gives 


—10.0n + wr = —m*(z)e’?*. 


The general solution of the latter equation is 
Yr =e 7 |- f m* (2) dz! +c], (11.136) 
20 


where z and c are arbitrary constants. We choose the constant c in such a way that 
there is no left-moving wave beyond the wall, i.e., at z — +00. We choose zo to be in the 
region behind the wall. There we have m* = my (without loss of generality, hı is real and 
(z — +00) = 0), and the solution at z — +00 is 


Wr = a el = e PZO) +e. 
pP 


Hence, the requirement of the absence of incoming wave at z — +00 gives 


Mf 2ipzo 
—— e : 
2p 


The solution (11.136) indeed has the form (11.135), and we find 


A= -i f m*(z)e""?*dz — i (11.137) 
zo 


where we have to take the limit z — +00. The reflection amplitude A for antiparticle has 
the same form with m* (z) substituted for m(z). 


Problem 11.22. Prove the last statement. 
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The reflection coefficient for left fermion Rz is equal to the modulus of the amplitude 
(11.137) squared. We are interested in the asymmetry 


R: — Ri = |AP - |A}. (11.138) 
To proceed further, we write the amplitude (11.137) in the following form, 
+00 , 
a=if [m*(z) — myO(z)] e?”? dz — a, 
—0o p 


where ©(z) is the usual step function, and we have taken the limit zo — oo. This is the 
final analytical result for the amplitude. To estimate the asymmetry (11.138), let us assume 
that the function Re[m(z)] — msO(z) is symmetric in z. Then 


M ia 2ipz 
A= -3p + Im[m(z)]e"”* dz, (11.139) 
where 
+00 
M=mf + af {Re|[m(z)] — mfO(z)} sin(2pz)pdz. 


We note that there is an order of magnitude relation 


M~ ms. (11.140) 
From (11.139) we obtain the asymmetry 
= IM fT” 
Rz — R; = = Im[m(z)] cos 2pzdz. 
Finally, we introduce 
iL. J" 
Ocp(p) = f Im[m(z)] cos 2pzdz, (11.141) 
lwmf =o 
and obtain 
7 2Mmylw 
R: — Rt = -t bor(p). 


Because of the oscillatory factor cos 2pz, the integral in (11.141) rapidly tends to zero at 
large momenta, p >> lw, while at p < lj," it is determined by the phase 0(z) of the function 
m(z). Making use of (11.140) we obtain the estimate 


2 
= 2mFlw 


Ri-Ri~ Oop, pz Sa’, (11.142) 


z 


where we recalled that p is the momentum normal to the wall. This estimate is independent 
of pz, py, as the Lorentz boost along the wall does not change pz. We stress that (11.141) 
contains fermion mass in the broken phase, mf = h1®-. The estimate (11.122) is obtained 
from (11.142) at p- ~ lp" by assuming that the wall thickness is of order ®7* (i.e., by 
assuming the validity of (11.119) and (11.120)). 

The asymmetry in reflection of right particles is 


Rr — Rr = — (Rz — R:). (11.143) 


Problem 11.23. Prove the last statement in the general case by making use of the prop- 
erties of Eqs. (11.130) and (11.131). 
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11.5.4. Electroweak baryogenesis, CP-violation and neutron EDM 


To end this Section we notice that both electroweak mechanisms we have discussed 
employ extra sources of CP-violation as compared to the Standard Model. This 
extra C'P-violation is in fact interesting from the viewpoint of precision particle 
physics experiments. 

One class of experiments sensitive to new CP-violating phases is the search for 
electric dipole moments (EDMs) of electron and neutron, de and dn. EDM d is 
the parameter of the Hamiltonian describing the interaction of spin S with electric 
field E, 

H=-d-E.- S 


The corresponding relativistic Lagrangian for electromagnetic interaction of fermion 


w is 
pe =a rye PF uv. (11.144) 


EDM violates P and T (see Sec. B.3) and therefore CP. EDM of composite par- 
ticle, neutron, is of the order of EDMs of quarks in it, dn ~ dy,dqa. The present 
experimental limits are [1] 


de < 1.05 - 107” . e - cm = 0.5 - 107? . e - GeV}, (11.145) 
dn < 2.9 - 107° . e - cm = 0.32 - 1071! -e- GeV, (11.146) 


where e is the electron charge. In the two-doublet model (11.107), the additional 
CP-violation in the Englert-Brout-Higgs sector induces quark EDM via one-loop 
diagram of Fig. 11.16, where H denotes all Higgs fields, and summation over these 
fields is assumed. The estimate for this diagram is 

e Mg Uuqi Uju YuYq; 


du ~ Oce 


, (11.147) 
(47)? mi 


Fig. 11.16. EDM of u-quark at one loop level. 
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where my is the mass scale of the Higgs bosons, 0cp is the C’P-violating phase, 
U is the quark mixing matrix (so that the Yukawa matrix is Y = Yaiag-U), (4m) ~? 
is the loop factor, and the quark mass mg, comes about due to the fact that the 
Lagrangian (11.144) violates helicity. 

The dominant contribution into (11.147) comes from virtual t-quark, so for 
Uut ~ 1 we obtain 


1 TeV 


MA 


2 
dn ~ dy ~ Oop: 1.6- 107" . ( ) e- GeV™!. (11.148) 
By comparing (11.148) to (11.146) we see that the phase 0cp must be fairly small, 
cp < 107% for my ~ 100 GeV, provided that mixing in Higgs-quark interactions 
is of order 1. Similar bounds on C’P-violating phases exist in other models. These 
bounds are completely independent of the baryon asymmetry. On the other hand, 
electroweak baryogenesis requires sizeable C'P-violating phases. Thus, most models 
with successful electroweak baryogenesis predict EDMs comparable to the limits 
(11.145), (11.146). As an example, the result (11.116) for the asymmetry in the 
model of the previous Section suggests that 0c¢p ~ 107?. Making use of the estimate 
(11.148) we obtain the prediction 


dn ~ (10° — 107") - ( 


This is close to the experimental limit for realistic mass scale of the Higgs bosons, 
say my ~ 200 GeV. The general conclusion is that the new mechanisms of CP- 
violation required for electroweak baryogenesis will most probably be indirectly 
discovered in the EDM experiments of the next generation. 


11.6. *Affleck—Dine Mechanism 


11.6.1. Scalar fields carrying baryon number 


Some extensions of the Standard Model contain not only quarks, but also scalar par- 
ticles that carry baryon number. Scalars carrying lepton numbers are also possible. 
These particles have not been found experimentally, so they must be quite heavy;?! 
roughly speaking, their masses must exceed a few hundred GeV. Yet the dynamics 
of the corresponding scalar fields in expanding Universe may lead to baryon asym- 
metry generation; the class of baryogenesis scenarios of this sort is generically called 
the Affleck—Dine mechanism [212]. Certainly, the necessary conditions of Sec. 11.1 
must be satisfied in this case as well. In particular, baryon number should not be 
exact symmetry, now of the Lagrangian of these scalar fields; CP must be violated 
as well. 


21 Assuming that their interactions with Standard Model particles are not particularly weak. 
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To construct a prototype model, let us extend the Standard Model by adding a 
complex scalar field ¢ carrying baryon number, By # 0, and a fermion w with zero 
baryon number. The kinetic and mass terms in the action for Y have the standard 
form, while the action for the scalar field is chosen as 


Sp = | dtey=5lg 8,480- V(O), (11.149) 
where 
2 1% A * 1\2 N 4 x4 
V(O) = mgo + ŽA) + ŽO + 0"). (11.150) 


The parameters \ and à’ are real and positive,?? and we assume that AX < X. 
Finally, there is a Yukawa term involving ¢, ọ and a spinor combination q of quark 
fields, 


Lint = had + h.c., (11.151) 


where h is the Yukawa coupling. In fact, q may be (and often is) a composite 
operator, a product of the Standard Model fields; it may or may not carry color. 
The only important property is that the operator q carries baryon number By, such 
that 


Bg = Ba, 


and the Yukawa term (11.151) conserves baryon number. We do not discuss here 
the transformation properties of the fields ¢ and ~ under the Standard Model gauge 
group SU(3). x SU(2)w x U(1)y; the relevant representations can be straightfor- 
wardly found. 


Problem 11.24. Find the representations of ¢ and yw under SU(3). x SU(2)w x 
U(1)y and the composite operator q such that the full action is gauge invariant. 


Were the last term in (11.150) absent, the model would be invariant under global 


phase rotations?’ 


p — Pep, q> teg, poy. 


The corresponding conserved quantum number is nothing but baryon number, and 
the baryon number density is 


ns = iBy(:d* 6 — 6 - DG) + Nag (11.152) 


where ns, = $(n4 — ng) is the baryon number density of quarks. For small but 


finite constant ’ in (11.150) baryon number is conserved only approximately. 


Problem 11.25. Obtain the expression (11.152) from the Noether theorem. 


22 The third term in (11.150) is, generically, (agt + h.c.) with complex A’. However, by redefining 
the field ¢ it can be cast into the form (11.150). 
23 Modulo electroweak anomaly which is irrelevant here. 
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The situation similar to what has been just described occurs in supersymmetric extensions 
of the Standard Model; see Sec. 9.6. The composite field ¢ is a combination of squarks, 
sleptons and Higgs fields, while the field ~ is a combination of gauginos. There is indeed 
the interaction of the form (11.152), and up to numerical factor, the coupling h coincides 
with the gauge coupling gs of the color group SU (3)e. The interaction term \/(¢* + h.c.) 
is forbidden by gauge invariance under SU(3)-, but higher order interactions violating 
baryon number and (B — L) are allowed (e.g., \’¢°). (B — L)-violation is important, since 
the generation of (B — L) asymmetry is necessary and sufficient for the generation of 
the present baryon asymmetry, once one employs mechanisms operating at temperatures 
above 100 GeV. 

A generic feature of SUSY extensions of the Standard Model is the existence of flat 
directions (moduli) in the field space, along which the scalar potential is small up to 
large values of the scalar fields. In terms of the potential (11.150) in which ¢ is the field 
parameterizing flat direction, this means that the mass m is low while the constants À and 
X are extremely small. 


11.6.2. Asymmetry generation 


Let us continue with the model described above. Let the field ¢ at initial time (right 
after the end of inflation in inflationary theory) be spatially homogeneous and take 
some complex value ¢; = r;et® / /2. The analysis of further evolution is simple in 
the case 


m?|i|? > Aldil*. (11.153) 


Let us discuss this case in some details; the opposite case is the subject of Prob- 
lem 11.27. We know from Sec. 4.8.1 that the field stays practically constant for some 
time during which the slow roll conditions are satisfied. Once these conditions get 
violated, the field, remaining homogeneous, evolves towards the minimum of the 
potential, o = 0, and in a few Hubble times comes to the vicinity of this minimum. 
Near the minimum, real and imaginary parts of the field evolve independently (the 
potential is quadratic), and each of them evolves according to (4.63), i.e., 
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a372 
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Re d= op = 


cos(mt + Br), 


— 


(11.154) 


Note that for time-independent scale factor a the trajectory (11.154) in the field 
space is an ellipse in complex plane (for 3; 4 Br). In reality, the scale factor a(t) 
grows, so the ellipse turns into elliptical spiral, see Fig. 11.17. The presence of the 
baryon number violating term in (11.150) (the term with A’) is important: were 
it not for this term, the phases Gr and 8r would be equal, and the ellipse would 
degenerate into interval. 
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Fig. 11.17. Trajectory ¢(t) in complex plane. 


The field (11.154) carries baryon number density 
ns = 1By(OG* -$ — 6°0:4) = 2Bo(bndvdr — 61046) (11.155) 


which is equal to 


mC,Cr. 
ns = 2Be ay sin(Gp — br). (11.156) 
In the absence of interactions with quarks, baryon number density decreases as 


Ng X a~’ 


, so baryon number is conserved in comoving volume (baryon number 
violation due to the last term in (11.150) is negligible at small ¢). Interaction 
with quarks (11.151) transmits this baryon number into quarks, and the Universe 


becomes baryon asymmetric in the end. 


As we noticed in Sec. 4.8.1, the field (11.154), in quantum language, describes a coherent 
state of ¢-bosons and their antiparticles at rest. The ¢-boson and its antiparticle carry 
baryon number By and (— By), respectively. Non-vanishing baryon number (11.156) means 
that the numbers of ¢-bosons and their antiparticles are different. The decay of the oscil- 
lating field ¢ into quarks and fermions ~ may, with reservations, be viewed as decays of 
o-particles and their antiparticles. In this language, the generation of asymmetry between 
quarks and antiquarks in the decay of the field (11.154) is completely obvious. 


Let us now turn to the estimate of the generated asymmetry. It is convenient to 
rewrite the action (11.149) in terms of variables r and 0, defined by 


T i0 
= —=e". 
ey 
The action for the homogeneous field in expanding Universe is 
3 Lo Lope 
Spo = | dta°(t) z” + z” 0° —Vi(r,0)}, 
where 


2 
V(r, 6) = or? Art 


XN 4 
= — 40. 11.1 
5 37 + gr cos (11.157) 
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This gives the equation for 0, 
(ar?) = ——.. (11.158) 


The expression for the baryon number density (11.155) has the following form in 
terms of the variables r and 9, 


Ng = Bor’. 
Hence, Eq. (11.158) reads 


10,5 av 


If the potential V were independent of 0, this would be the baryon number conser- 
vation law for comoving volume. For the 6-dependent potential Eq. (11.159) is the 
equation describing the generation of baryon number. 

Let us assume that the initial phase 0; is nonzero. This means that the initial 
state is CP-asymmetric; it is in this place where one of the necessary conditions for 
asymmetry generation shows up. This assumption is important, as the evolution for 
6; = 0 proceeds along Im ¢ = 0 (recall that A’ is real), and baryon number density 
(11.155) is always zero. For non-vanishing 6;, the baryon number density at later 
time t is 


t 
a? (t)ns(t) = -B | ag? (t’)dt! 5 J r (t'ja? (t) sin 40(t') dt’. (11.160) 
ti ti 
Since rfa? x a~? œ t73/?, the integral in (11.160) is saturated at lower limit of 
integration, i.e., in a few Hubble times?4 after the field starts rolling down towards 
$ = 0: before that, OV/08 is constant but a(t) is small; after that OV/00 x r* is 
small (proportional to a~®, see (11.154)). Note that we assume that the evolution 
of the field ¢ occurs at the hot epoch. We proceed to work under this assumption, 
which implies, in particular, that the energy of the field @ never dominates in the 
Universe. We discuss the condition for that later on. 
Equation (11.160) immediately gives the estimate 
a(t) OV 1 


mal) ~ ay oa’) HCE)’ 


where t, is the time at which the field starts to roll down. We recall that a(t) x Tt 
and obtain the estimate for asymmetry, 


1 4 
2 ad A = sin 46;. (11.161) 
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24t is at this point that the assumption (11.153) simplifies the analysis. 
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The relation (11.153) shows that the slow roll conditions get violated when 
m? ~ H?. (11.162) 


Making use of this relation, we write the estimate (11.161) as 


„\ 5/2 82 
Ap ~ Ng "^ (=) (z) sin 40;. (11.163) 


Clearly, the resulting asymmetry strongly depends both on parameters of the model 
(mass m and coupling A’) and on the initial amplitude r; and phase 6; of the scalar 
field ¢. Hence, the value of A, appears to be the result of a choice of initial data. 

To find out under what conditions the expression for the asymmetry (11.163) 
gives the required value A, ~ 107!°, we have to specify the initial data r; and 6;. 
One possible (but not at all unique) choice is to assume that the initial phase is not 
small, sin40; ~ 1, while the initial amplitude is of the order of the Planck scale, 
ri ~ Mpi. Then one finds from (11.163) and m < Mp; that the required asymmetry 
is obtained only for very weak baryon number violation, \’ < 10~!°. Otherwise the 
asymmetry is too large. This result illustrates rather general property of the Affleck- 
Dine mechanism: it often (though not always) generates too large asymmetry. 

At ri ~ Mp), the assumption that the slow roll regime terminates when the 
Universe is already at the hot stage of its evolution is valid for quite small m and 
very small A only. Let Tmas be the maximum temperature in the Universe. Then 
the relation (11.162) indeed holds at T; < Tmas only if 


2 
m < z, 


M3, 

For Taas ~ 1015 GeV (which is the order of magnitude of the maximum possible 
reheating temperature in realistic inflationary models; see the accompanying book) 
and r; ~ Mp; we have 


m < 10% GeV. (11.164) 


Furthermore, our assumption that m?|¢|? >> A|¢|* holds only for A < 10714, while 
the required baryon asymmetry is obtained from (11.163) at \’ < 10777. These 
estimates illustrate another property of the Affleck—Dine mechanism: it needs almost 
flat scalar potential V(¢) for the field ¢. As we pointed out, almost flat potentials 
are rather natural in supersymmetric theories. 

Let us discuss one more property of the Affleck—Dine mechanism, still assuming 
that r; is roughly of order Mp;. It is clear from (11.162) that at the beginning of 
oscillations, the energy of the field @ is roughly comparable to the energy of hot 
matter, 


polt) mr n 


a 


Prad(tr) H? M3, ~ M3, 
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The energy density of the oscillating field (11.154) decreases in the same way as 
that of non-relativistic matter, pọ « a~3(t), see (4.69), and the decay of oscillations 
occurs at H ~T (see Sec. 5.3), where I is the decay width of the ¢-boson. For T « m 
(which is consistent with the fact that ¢ has tiny self-interaction) this implies that 
the energy of the oscillating field starts to dominate in the Universe soon after 
the oscillations begin. The existence of the intermediate matter-dominated stage is 
another characteristic property of the Affleck—Dine mechanism. 


Problem 11.26. If there is indeed the intermediate matter-dominated stage, the 
estimate (11.163) is not valid. Estimate A, in that case, still assuming that slow 
roll conditions get violated at radiation domination. 


Problem 11.27. Consider the case 
AI di|* > m?|d7]. 


(a) Show that the amplitude of oscillations in the case of quartic potential, V = 
A¢*, decreases as a~! (as opposed to a~*/? for quadratic potential). This means 
that OV/00 in the integrand in (11.160) decreases as a~* until the time when 
Art ~ m?r?, and only afterwards it decreases as a~®. 
(b) Making use of this observation and assuming that the energy of the field @ never 


dominates in the Universe, show that the resulting asymmetry is of order 


OD fig eee eg a “an. 
m3/4 Pl ‘ 


(c) Show that for ri roughly of order Mp, the required properties of the theory (small 
m, very small X', etc.) are qualitatively the same as in the case considered in 
the text. 


Thus, the Affleck—Dine mechanism can successfully work at the hot stage, albeit 
in a rather restricted range of parameters. The scalar potential must be quite flat, 
and the maximum temperature in the Universe quite high. An interesting possibility 
here is that the Universe is temporarily dominated by the oscillating scalar field, 
and hence experiences intermediate matter dominated stage. 

If the maximum temperature in the Universe is not very high, and the mass of the 
field ¢ is relatively large, then the field ¢ rolls down to the minimum of its potential 
before the hot stage. This is a fairly realistic situation. In that case the Affleck- 
Dine mechanism may work at post-inflationary reheating epoch. The corresponding 
estimates are different from those given above, but the general conclusion on the 
possibility of the baryon asymmetry generation remains valid. 

Let us now discuss a version of the Affleck—Dine mechanism [213], in which the 
initial value r; is determined dynamically, while the flatness requirements for the 
scalar potential are less severe. Let us assume that besides the terms written in 


374 Generation of Baryon Asymmetry 


(11.149), there is one more term in the action, 


Sao = - | dxy=Ger|o?, (11.165) 


where c is a positive dimensionless constant which we choose somewhat larger (but 
not much larger) than 1. For the FLRW Universe one has 


R = —8rGT} = —87G(p — 3p). 
The initial value of ¢ is determined by its dynamics at inflationary epoch, when 
p% —p and R = —12H?, so the extra term effectively changes the mass term in the 
potential, 


m? ||? > (m? — 12cH?) ||’. (11.166) 


At inflation one has H >> m, the potential has maximum at ¢ = 0, and in the limit 
A — 0 there is a valley of minima on a circle in complex plane, 


= (11.167) 


Slow roll conditions are not satisfied in this case for radial motion (this is the reason 
for the choice c > 1), so r(t) is approximately given by (11.167). At small but finite 
A the valley (11.167) is slightly inclined, the potential depends on the phase 0, but 
this phase is almost flat direction. The evolution along this direction occurs in the 
slow roll regime. 


Problem 11.28. Show that for H = m and X < X the evolution of the phase 0 
occurs in the slow roll regime. 


At radiation domination one has p ~% tp, so that R « H?, the contribution 
(11.165) is irrelevant and the dynamics is described by the scalar potential (11.150). 
If the reheating after inflation is fast and happens in a few Hubble times, then right 
after inflation the field ¢ rolls down towards ¢ = 0, as described in Problem 11.27. 
The initial value of ¢ is determined by the Hubble parameter at the end of inflation, 
which in the instant reheating approximation coincides with the Hubble parameter 
at the beginning of the hot epoch, 


24cT4 
2 man | 11.168 
Ti AM5? ( ) 
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As long as the fourth order term dominates in the scalar potential, r(t) decays as 
a`! œ T; see Problem 11.27. At time tm, when \r? ~ m?, this regime terminates, 
the field starts to decay faster, r x a~/?, and the asymmetry generation terminates. 
At that time, the temperature is estimated as follows: 


MT max 


Tm a Mar, : 


(11.169) 


As a result, we obtain from (11.160) that the asymmetry is 
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where tren is the time the hot epoch begins and temperature equals Tmar. To 
evaluate the integral, we have used the relation (3.34). We recall Eq. (11.169) and 
then Eq. (11.168) and obtain finally 
/ T5 
Arp w (24 5/2 B L mar 
B EE E, 
We see that if Tmar is not very low, we do not need very flat potential; as an 
example, even for Tmax ~ 101? GeV, AX = 0.1A, m = 1 TeV, the required baryon 
asymmetry is obtained at \ ~ 1074. 


Problem 11.29. Show that in the latter scenario, the initial value r; is typically 
relatively low, ri < Mp). This means, in particular, that the term (11.165) is small 
compared to the Einstein—Hilbert term, and gravity does not get modified. 


Problem 11.30. Study the latter scenario at c > 1, where c is the parameter in the 
action (11.165). Can this mechanism work at the hot epoch and yield the required 
baryon asymmetry for Tmax < 10° GeV? 


11.7. Concluding Remarks 


The baryon asymmetry generation mechanisms considered in this Chapter by no 
means exhaust the possibilities proposed in literature. Among other things, the 
asymmetry generation may occur at post-inflationary reheating epoch, rather than 
at the hot stage. Unfortunately, many mechanisms (like those studied in Secs. 11.3 
and 11.4) make use of physics beyond the energy scale accessible at future colliders. 
Hence, direct experimental evidence in favor of one of these mechanisms will be 
hard if not impossible to obtain. The exception is the electroweak mechanism that 
will be supported or falsified by collider experiments in near future. Evidence for 
leptogenesis through neutrino oscillations could come from the discovery of sterile 
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neutrinos with masses below the electroweak scale; this would be possible at exper- 
iments of new generation. As to the Affleck—Dine mechanism, it may be supported 
by the discovery of baryon isocurvature perturbations in the spectrum of density 
perturbations in the Universe.?° Search for isocurvature perturbations is one of the 
goals of the studies of the CMB anisotropy and polarization, galaxy surveys, etc. 
We consider these issues in the accompanying book. 


25 Generation of baryon isocurvature perturbations is a possible, but not necessary consequence of 
the Affleck—Dine mechanism, so this mechanism cannot be ruled out by the cosmological data. 
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Chapter 12 


Topological Defects and Solitons 
in the Universe 


In this Chapter we consider cosmological aspects of field theory models which admit 
soliton or soliton-like solutions. These solutions are specific (sometimes macroscopic) 
field configurations whose stability is ensured either by non-trivial topology of the 
space of vacua (topological defects) or by the existence of conserved global quantum 
numbers (non-topological solitons, Q-balls). Of interest are both particle-like soli- 
tons (monopoles, @Q-balls) and extended objects (cosmic strings, domain walls); we 
will jointly call them defects or solitons in what follows. These objects are absent 
in the Standard Model (modulo exotic proposals like quark nuggets), but they nat- 
urally exist in some extensions of the Standard Model. Generally speaking, there 
may be also unstable solutions whose lifetime is comparable to the Hubble time. 

We point out right away that no undisputable evidence for defects of any sort in 
our Universe has been found so far. Nevertheless, their study is of considerable inter- 
est. We will see that there are numerous effects occurring in models with defects, 
including peculiarities of the expansion history, new mechanisms of structure forma- 
tion, lensing of distant sources, features in CMB angular anisotropy, new processes 
generating the baryon asymmetry, etc. The energy scale of the defect formation is 
considerably higher than the electroweak scale. Experimental discovery of defects 
would thus give evidence that the temperature in the Universe reached that energy 
scale.! Furthermore, this discovery would show the existence of physics beyond the 
Standard Model and give a hint on what this new physics is. This would open up a 
window to energy scales well exceeding the reach of particle colliders. 


A general reason for the existence of topological defects in field theoretic models is the 
non-trivial topology of the space of vacua in these models. This means that the ground 
state — vacuum — is not unique, and, furthermore, some homotopy group my of the 
manifold of vacua M is non-trivial, 


wn(M) 40. (12.1) 


lIn fact, there exist mechanisms of defect production that do not require so high temperature. 
But in any case, the energy density at the time of defect production must be very high. 
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In other words, there exist non-trivial mappings from N-dimensional sphere S* to the 
vacuum manifold M. In most cases the defect configuration corresponds to a non-trivial 
mapping from spatial asymptotics (sphere S” in the general case) to vacuum manifold 
(since the fields tend to vacuum values at spatial infinity, otherwise energy would be 
infinite). In space-time of dimension d + 1 the property (12.1) suggests the existence of 
topological defects of space-time dimension d — N. Stability of these defects is due to 
the fact that destruction of a defect would require rearrangement of the fields at spatial 
infinity; that would cost infinite energy. Non-trivial configurations thus have topological 
charges which are conserved. In the case of 4-dimensional space-time, three types of defects 
are possible, whose dimensions are 2 + 1 (walls), 1 + 1 (strings) and 0 + 1 (particle-like 
defects, e.g., monopoles). There may exist hybrid defects, including combinations of defects 
of different dimensionality. 

We note that the relevant symmetry of a theory may be local (gauge) or global. Accord- 
ingly, the defects are called local or global. In the former case the energy is localized. On the 
contrary, in the global defect case, the gradient energy falls off with distance so slowly that 
the energy integral diverges. In the physically relevant situation with numerous defects this 
means that one cannot neglect the interaction between them. These interactions render 
the energy density finite. 


12.1. Production of Topological Defects in the Early Universe 


A rather general property of field theory models with topological defects is that the 
defects exist only in the phase with spontaneously broken symmetry, whereas they 
are absent in the unbroken phase. We will encounter this situation in Secs. 12.2-12.5. 
As we discussed in Chapter 10, symmetry is usually restored at high temperatures, 
and the broken phase appears in the Universe at lower temperatures as a result of 
phase transition. Thus, the existence of topological defects is possible only after this 
phase transition, i.e., at T < T}, where Te is the phase transition temperature.” 

Defects may be produced after the phase transition thermally, in collisions of 
particles. This mechanism is often inefficient. If defects equilibrate, their density is 
small. Indeed, considering for definiteness particle-like defects like monopoles (see 
Sec. 12.2), we quote the result that their mass Mr p is typically much larger than the 
critical temperature, so the equilibrium abundance is suppressed by the Boltzmann 
factor 


M: 
nied) xe te. (12.2) 


Strings and domain walls are suppressed even stronger. 

There exists, however, another way of producing topological defects, which is 
called Kibble mechanism [226]. We will consider this mechanism in detail in Sec. 12.2 
where we study magnetic monopoles, but we will see that it is quite general. This 
mechanism works at the phase transition epoch for topological defects. The phase 


2In fact, the production of topological defects is possible at both thermal phase transitions occur- 
ring at the hot Big Bang stage and non-thermal phase transitions that may take place, e.g., at 
post-inflationary reheating. 


12.2. °t Hooft—Polyakou Monopoles 379 


transition proceeds independently in regions of the Hubble size. Inside a Hubble 
region the field configurations may be correlated, so the system may end up in 
one and the same vacuum. On the other hand, vacua in different Hubble regions, 
generally speaking, are different points of the vacuum manifold. Hence, the topology 
of the field configuration at the spatial scale exceeding the Hubble scale may coincide 
with the defect topology. The relaxation processes after the phase transition do not 
change the topology, so the result is the formation of a defect. This is precisely 
the Kibble mechanism. It is clear from the above reasoning that this mechanism is 
quite universal; it works for defects of different types and depends rather weakly on 
Mrp/T.. 

More precisely, the number density of defects produced by the Kibble mechanism 
is determined, right after the phase transition, by the correlation length leor, which 
definitely does not exceed the Hubble size,? H~1+(t.). So, the number of defects is 
at least of order 1 per the Hubble volume, 


TS 
nrp(te) ~ lor Z H? (te) = Me 3° 
PI 


(12.3) 
This estimate is general for point-like topological defects, modulo a numerical factor. 
(The latter may be quite different from unity, though.) For extended defects (cosmic 
strings, domain walls), the above argument applies to the distance lp(te) between 
them, i.e., one has Ip(t.) £ H~! (te). 

Further evolution is specific for defects of different types. As arule, this evolution 
is out of thermal equilibrium: the equilibrium number density of defects falls off 
exponentially, while the actual density exhibits power-law behavior. The evolution 
of various defects will be considered in the following sections, and here we mention 
one property. Right after the phase transition, defects do not affect the expansion 
rate, since their energy density is smaller than that of hot matter by a factor T/M% 
where the positive exponent a depends on the type of defects. (It follows from (12.3) 
that a = 3 for point-like defects.) However, at later times the energy density of 
defects may become significant and may even dominate the expansion. 


12.2. °t Hooft—Polyakov Monopoles 


12.2.1. Magnetic monopoles in gauge theories 


The simplest model admitting monopole (and antimonopole) solution [227, 228] is 
the Georgi-Glashow model. This is the gauge model with gauge group SU(2) and 
triplet of scalar fields $°,a = 1,2,3 which transform under adjoint representation. 


3In the case of the first order phase transition, leor is the typical size of bubbles at percolation. 
For the second order phase transition, leor is the size of a region in which the energy needed to 
unwind the defect is of the order of temperature. 
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The Lagrangian is 
L= _} pa pow + lp oD" gb? — r (get =y. (12.4) 
a Ta 4 i l 
where 
Fisy = 3p Ap — OAT, + ge" AP AS, 
Dud® = pG + ge A? Ge. 
The last term in (12.4) is self-interaction of the scalar fields leading to spontaneous 
symmetry breaking. The vacuum energy density is determined by minimizing the 
scalar potential, so that 
(24%) = v. (12.5) 


This equation determines the vacuum manifold; in our case it is a 2-sphere S?,,. 
Each point in this manifold is invariant only under the subgroup U(1) of the gauge 
group SU(2). Hence, the symmetry breaking pattern is SU(2) — U(1). We identify 
the unbroken U(1) as the gauge group of electromagnetism in this toy model. 

By choosing the vacuum as 


(9°) = 03%, (12.6) 


we find that the field A? remains massless (gauge field of unbroken U(1), photon), 
while the fields 


E 1 J: + N2 3 
Wr = (A, £74,) and h= -v 


obtain masses 
my =gv and Mp = V2X0, 
respectively. 


Problem 12.1. By writing the quadratic action for small perturbations about the 
vacuum (12.6), find the spectrum of the theory, thus confirming the above statements. 


Let us study static field configurations of the form 
Ag =0, A= Aix), $° = ¢°(x), 


and require that the energy is finite. The energy functional is 
3 a pa a a aya 2)\2 
B= fex trang + spore" + Meroe]. an 


Hence, at spatial infinity (r — oo, where r°? = x”) the fields tend to vacuum values, 
ppe = v?, Af, = 0, up to gauge transformation. Furthermore, the convergence of 
the energy se requires sufficiently fast decay at infinity of the gauge-covariant 


i Dig” and (¢°¢*% — v oe 


quantities F 
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Fig. 12.1. Asymptotic behavior of the Englert-Brout—Higgs field for the monopole configuration. 
The direction of the scalar field in internal space (shown by arrows) is the same as the direction 
of the radius vector in the physical space. 


The length of the vector (#1, ¢7, °) is equal to v at spatial infinity. Hence, any 
finite energy configuration defines the mapping from a sphere $2, at spatial infinity 
to vacuum manifold $2... Since 


mə(S?) =Z £0, 


this mapping may be topologically non-trivial, and the configuration topologically 
stable. 

The simplest non-trivial mapping is given by the hedgehog configuration of the 
scalar field, see Fig. 12.1, 


re 
— 12: 

- (12.8) 
The energy (12.7) is finite provided that the asymptotics of the vector field is 


i 
Af (x) = —e nj. (12.9) 
gr 


Thus, we write the following Ansatz, 


ge? =un*-(1—f(r)), 


il ga. 
a — ijn j(y — 
AS af nj(1—a(r)). 


(12.10) 


This Ansatz is invariant under spatial rotations supplemented by rotations in the 
internal space, i.e., by global SU(2)-transformations. The functions f(r) and a(r) 
must obey the requirements of the absence of singularity at the origin and finiteness 
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of energy. This gives 
f(r > 00) = a(r > œ) = 0, 
[l-f(r-0)) xr, [L-— a(r 40)] x r?. 


The actual configuration of the soliton is obtained by solving the field equations in 
terms of f(r) and a(r). 


Problem 12.2. Show that the Ansatz (12.10) is consistent with the field equations. 
Show that at g ~ VX, the functions f(r) and a(r) decay exponentially at spatial 
infinity. 


The soliton just described is a magnetic monopole. Indeed, according to (12.9), 
the electric components of the field strength vanish, while the magnetic components 
are 

a 1 i 1 
B; = Tgiik Tjk = gree (12.11) 
Their direction in the internal space coincides with the direction of the scalar field, 
so this field is actually the magnetic field of the unbroken electromagnetic U(1) 
gauge group. The gauge-invariant field strength 


B; = B? =-+ (12.12) 


equals the magnetic field of magnetic monopole of magnetic charge gm = 1/g. The 
massive vector and scalar fields decay exponentially away from the monopole center, 
and far from the center the field coincides with that of the Dirac monopole. The 
soliton we discuss is known as the ’t Hooft-Polyakov monopole. 

It is useful to note that there is also an antimonopole solution. It has the different 
behavior of the scalar field, 


O = —vn"(1 = J) 


whereas the field A% is the same as for monopole. The magnetic field (B?¢"/v) has 
the sign opposite to that in (12.12). 

For my ~ mp, the monopole mass can be estimated on dimensional grounds. 
One rewrites the integral (12.7) by changing the variables 


—1 a a a a 
= (gv) Ç; Ay, = vA; (o) = vo. 
Then the energy (12.7) becomes 


p= 


=x feel; FAFA + IDin; — 


where Diy? = ip” + e Ay, etc. The monopole configuration minimizes the 
energy functional, and for my ~ Mp the integrand does not contain large or small 
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parameters. Hence, the monopole energy (mass) is estimated as 


where the factor 47 corresponds to angular integration. Thus, the monopole mass 
exceeds the energy scale v of symmetry breaking SU (2) — U (1). 

The existence of magnetic monopoles is a general property of Grand Unified 
Theories, GUTs. In GUT context, monopoles have masses of order m,, ~ 10!” GeV. 
They are produced at the GUT phase transition which occurs at Te ~ 1018 GeV. Of 
course, this is true under the assumption that so high temperatures indeed existed 
in the Universe. 


Let us describe the necessary (and very often sufficient) condition for the existence of 
magnetic monopoles in a general gauge theory with the Englert-Brout—Higgs mechanism. 
Let the Lagrangian be invariant under gauge group G and the ground state be invariant 
under subgroup H of the group G. In other words, the symmetry breaking pattern is 
G — H. The manifold M of vacua is then the coset space G/H (here we assume that G acts 
on M transitively, i.e., all vacua are related to each other by symmetry transformations; 
this assumption is not, in fact, completely necessary). Stable monopole configurations are 
possible if the vacuum manifold M contains non-contractible 2-spheres, i.e., if its second 
homotopy group is non-trivial, 


m2(G/H) #0. (12.13) 


If the gauge group G is simple or semi-simple (gauge groups are always compact), and H 
includes one factor U(1), then 
mo(G/H) = m(H) =Z, 

so monopoles exist. In the case of the Standard Model, the gauge group G = SU(3) x 
SU(2) x U(1) is not semi-simple, the unbroken subgroup is H = SU(3) x U(1) and 
m2(G/H) = 0; thus, there are no monopoles. The situation is entirely different in GUTs. 
There, the gauge group is, as a rule, simple (sometimes semi-simple), and the symmetry 
gets broken at high energies down to SU (3)e x SU(2)w x U(1)y, and at lower energies to 


H = SU(3)- x U(1)em (an example is the SU(5)-theory, discussed in Sec. 11.2.2). Hence, 
m2(G/H) = Z, and monopoles always exist. 


12.2.2. Kibble mechanism 


Let us make use of the Georgi-Glashow model and the ’t Hooft—Polyakov monopole 
to discuss the Kibble mechanism. The Englert-Brout—Higgs expectation value van- 
ishes at high temperatures, the system is in symmetric phase and monopoles do 
not exist. The Englert-Brout—Higgs expectation value becomes nonzero as a result 
of the phase transition. Just after the phase transition, the directions of the scalar 
field are uncorrelated at distances exceeding the correlation length leor. As a result, 
there are regions in the Universe with the scalar field shown in Fig. 12.2(a), as well 
as regions where the scalar field configurations show the patterns of Figs. 12.2(b) 
and 12.2(c). The configuration of Fig. 12.2(a) is topologically trivial, and in the 
course of evolution it relaxes to a state without a monopole. On the other hand, 
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Fig. 12.2. Possible Englert-Brout—Higgs field configurations just after the phase transition. Circle 
radii are of order leor. 


the configuration of Fig. 12.2(b) has hedgehog topology, cf. Fig. 12.1; it relaxes to 
a monopole. The configuration of Fig. 12.2(c) becomes an antimonopole in the end. 
The probabilities of all three types of configurations are roughly the same (and 
precisely the same for configurations of Figs. 12.2(b) and 12.2(c)), so the number 
densities of monopoles and antimonopoles just after the phase transition are esti- 
mated as ny = Nm ~ ia The Englert—Brout—Higgs condensate is definitely formed 
in an independent way at separation exceeding the cosmological horizon, so that 
leor L£ H~1+(T.), as we noticed in Sec. 12.1. Hence, the abundance of monopoles 
produced by the Kibble mechanism is indeed estimated as in (12.3). 


12.2.3. Residual abundance: the monopole problem 


Monopoles are non-relativistic at temperatures below the phase transition in which 
they are produced. If the monopole—antimonopole annihilation were negligible, the 
ratio of their number density to entropy density would remain constant. Hence, the 
mass density of monopoles today would be equal to 


Pm,0 = MmnmM,0 = MmT) 


f T 3 g 
Ee S oe D dVn 12.14 
1016 GeV \ 1016 GeV cg? Vet 


where we made use of the expression (5.28) for entropy density and estimated the 
monopole number density according to (12.3). The scale 10!° GeV is typical for 
GUTs, so we see that the mass density of monopoles would exceed the critical 
density pe ~ 1075 GeV cm~? by 17 orders of magnitude. As we will see momentarily, 
monopole—antimonopole annihilation does not change the result qualitatively: there 
remain too many monopoles. This is the essence of the monopole problem [229, 230], 
as we should either suppose that the temperature in the Universe was never as high 
as Te ~ 1016 GeV, or give up the idea of Grand Unification. 

Let us now take into account monopole—antimonopole annihilation. First, let 
us check that monopoles are in kinetic equilibrium with plasma. A non-relativistic 
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monopole in plasma experiences effective friction force due to the electromagnetic 
interactions with plasma particles. This force is estimated as f ~ n(T)- o- Ap, 
where n(T) is the number density of charged particles in plasma, o ~ ag?,/T? is 
the interaction cross-section and Ap ~ Tv,, is the momentum that monopole of 
velocity vj, looses in each interaction. So, we have an estimate 


2 
Ff = —-KT UM; 


where k ~ ag?,gx ~ gx. We now write the second Newton law neglecting the 
cosmological expansion, 


We see that the monopole velocity changes significantly in the time interval 
tu ~ My /(kT?). This time is always smaller than the Hubble time M},/T?, so 
the monopoles are indeed in kinetic equilibrium with plasma. They have thermal 
velocity of order vm ~ vr = \/T/my,,, and mean free path is 


1 Mar 


acy T` 


ly ~UT'tu = 


The mean free path grows as temperature decreases. 

The efficient monopole—antimonopole annihilation occurs at high temperatures 
when the mean free path is short. The monopole and antimonopole attract each 
other, while interactions with plasma damp their velocities. As a result, they form 
monopolonium, the monopole—antimonopole bound state. Monopolonium then anni- 
hilates into conventional particles. Unlike direct annihilation, which has very small 
cross-section, this two-stage process reduces the monopole abundance significantly. 

The bound state can be formed when the electromagnetic interaction energy 
of monopole and antimonopole exceeds temperature, i.e., at r < ro = g?,/T. This 
determines the cross-section of monopolonium production, which subsequently leads 
to annihilation, 


2 4 2 
Cann ~ 13 = g8,/T*. 


This estimate is valid at high temperatures, when lw is small compared to rọ, 
otherwise monopolonium is not formed, and annihilation practically does not occur. 

The monopole abundance at high temperatures is such that the monopole mean 
free time with respect to annihilation is of order of the Hubble time, 


T? 
Mp, 


~ 


OannnmUM 


We recall that monopoles have thermal velocities and obtain 


" 
Nat 4 Mñ wd 


aE gg 12.15 
aa aT ( ) 
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where we used s ~ gT’. We see that annihilation dilutes nw /s as the Universe 
expands. This regime, however, terminates at lu ~ ro, and monopole abundance 
freezes out. The latter relation gives freeze-out temperature, 


M m 
Tps a: 
Jm” 


The monopole abundance at freeze-out is then found from (12.15), 


N 1 M 


~~. (12.16) 
s Kg, Migs 

This ratio stays constant until today; using g2, ~ a~! ~ 100, & ~ gx ~ 100 we get 
the present mass density 


2 
Pm,0 = my, ‘SO™ 10". (ea) GeVcm~?. 
This is much smaller than the estimate (12.14), but still way too large for GUT 
monopoles with m,, ~ 10!” GeV. 
In retrospect, monopole problem was a strong reason for not extrapolating the 
hot Big Bang theory up to temperature 1016 GeV. This was (and still is) one of the 
arguments in favor of inflation. 


Problem 12.3. Show that at T < Ty the monopole—antimonopole annihilation 
practically does not change the ratio (12.16). Hint: Assume that the effect of plasma 
on annihilation process is negligible. Take into account Coulomb enhancement of 
the annthilation cross-section. 


Problem 12.4. Find the temperature in an unrealistic Universe where monopoles 
of mass my, ~ 10!" GeV were produced at GUT phase transition at T ~ 10'° GeV, 
at the time when the Hubble parameter takes the same value as in our present 
Universe. 


12.3. Cosmic Strings 


12.3.1. String solutions 


The minimal model admitting cosmic strings — dimension-1 topological defects — 
is the Abelian Higgs model.’ Its Lagrangian is 


1 a= 
L= DY§"Dyb — Fw F” — > (es 7 >) 


Dud = Ono — iedh, Fav = Op Ay — Op Ap, 


(12.17) 


4Models with global Abelian symmetry group may admit global strings. 
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where @ is complex scalar field, A, is the gauge field of U(1) gauge group. The 
Lagrangian (12.17) is invariant under U(1) gauge transformations 


N , 1 
dade, gga“), Ay > Au + znala). 


The U(1) symmetry is spontaneously broken in the theory (12.17): the vacua are 
obtained by minimizing the scalar potential and obey 


. v 
(9°$) = >- (12.18) 
Neither of this vacua is invariant under U(1); the symmetry G = U(1) is broken 
completely (G — H, H = I). According to (12.18) the vacuum manifold is a circle 
S1 (This is in accordance with the relation G/H = U(1), as U(1) is itself a circle.), 


e, ae [0, 27). 


The spectrum is obtained, as usual, by writing the Lagrangian (12.17) as Taylor 
series about the vacuum values of the fields ¢°*° = (@), Aj*° = 0 and using the 
gauge Imọ = 0, i.e., ¢ = v/V2+ h with real h. One finds that all fields become 
massive as a result of spontaneous symmetry breaking; the model contains real 
vector and scalar fieds with masses 


My =ev and Mg = v2), 


respectively. 

In the hot Universe at T >> v symmetry is restored, and both real and imagi- 
nary components of the complex field ¢ take random values at different points in 
space. The random phase a(x), defined formally as arctan[Re¢/Imd], is distributed 
homogeneously. At critical temperature T,, the scalar field acquires nonzero value, 
and the values of the phase a(x) get fixed. Generally speaking, they are different at 
different points. Due to the gradient term in the energy, the configuration becomes 
more homogeneous in the course of evolution. The phases a(x) may become equal 
over the horizon scale in the end, but this is not necessarily the case: the ini- 
tial state may be such that there is a non-trivial winding around some contour C. 
Single-valuedness of the field requires that the phase obeys 


Aa = f da ag =27N, (12.19) 
c d0 

where @ is the asimuthal angle in physical space and N is an integer. Configurations 
with N # 0 become strings in the end, and the mechanism of their production which 
we just described is nothing but the Kibble mechanism. Cosmic string may again 
be illustrated by Fig. 12.1, where the plane of the plot is now the plane orthogonal 
to the string, and arrows show the directions of vector (Red, Im@) in internal space. 
With this clarification, the Kibble mechanism is again illustrated by Fig. 12.2; it 
leads to the production of about one piece of string of length leor per volume [3 


cor* 
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Continuity of the field ¢ guarantees that strings must be either closed or infinite. 
The latter are open strings stretching across the entire horizon.” The contituity of 
requires also that ¢ vanishes along a line surrounded by the contour of integration C 
in (12.19). This means that the string configuration has large energy density there, 
E ~ Mt. 

At large distance from the center, the string configuration minimizes the scalar 
potential (12.18), but has non-trivial angular dependence of ¢ and A,,. In the case 
of infinite straight string stretching along z-axis, the asymptotics in the plane (x, y) 
are (modulo irrelevant constant phase) 


' N 
$ —> ao A, > T Opb. (12.20) 


The winding number N is a topological invariant. The behavior of the field A, in 
(12.20) is such that asymptotically, i.e., at x?-+y? — 00, there are no physical fields, 


Die 0, Fu 9, 


and string energy per unit length is finite. It follows from (12.20) that there is 
magnetic field B = V x A inside the string core. Its flux is given by 


[Bas = f Aoo = BO 
e 


We see that it is quantized in units of 27/e. 

These defects are called Abrikosov vortices in condensed matter theory; these 
are magnetic flux tubes in superconductors. In particle theory, these objects are 
known as Abrikosov—Nielsen—Olesen strings [231, 232], or simply cosmic strings. 


In gauge-Higgs theories of general type, the existence of cosmic strings is possible provided 
the vacuum manifold M is not simply connected, i.e., it contains non-contractible loops. 
The necessary (and often sufficient) condition for the existence of cosmic strings is thus 
non-trivial first homotopy group; in notations used in (12.13), 


m™(G/H) £0. (12.21) 


Rotationally symmetric string Ansatz is 


v ; N e327 
= —|1 — eN? Ai =—-———[1-a(p)], i,j =1,2, 12.22 
$ 7 F(p)] ae l-al), ij (12.22) 
where p = \/(x!)? + (x?)? is the radial coordinate in the plane (x,y). Here, the 


rotation of the plane can be undone by the phase rotation of ¢. Large-p asymptotics 


5For this reason it is hard to “get rid” of strings, if they exist in the Universe. 
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should coincide with (12.20), which means 
f(p > 0%) +0, alp > 00) > 0. 
The fields must be non-singular at the origin, so we have the following behavior, 
fp 0) 41, a(p+0) 51. 


Analytical solutions of the field equations are unknown, but the solutions are 
straightforward to find numerically by making use of the Ansatz (12.22). 

Let us estimate the string energy per unit length, i.e., the string tension. In 
analogy to the estimate of the monopole mass, we write 


_ dE P D tale EAN 
p= J teps Dio+ 1r} +a (0 = (12.23a) 


1 À i 
2 2 * 2 * 
v fe E Ds Dip + gii te a2 (v p- 5) | (12.23b) 


where the change of variables is 6 = vy, x = (ev)~'€, A; = vA;. This gives the 
estimate for the tension, 


enw me’, (12.24) 


for N ~ 1 and à ~ e. We also see that the string thickness ls, i.e., the radius of 
the region where energy density is considerable, is estimated as € ~ 1, which gives 
ls ~ (ev)~!. The tension can be calculated exactly in the special case My = My 
with the result 


p= Tv. (12.25) 
This supports the estimate (12.24). 


Problem 12.5. Express the string tension in kg/cm for v ~ 10! GeV. Compare 
the mass of the Earth with the mass of a string encircling the Earth along equator. 


The fact that N is a conserved topological number does not tell what is the 
minimum energy configuration for given N at |N| > 1: this can be one string of 
winding number N or N strings of unit winding number. It turns out that the result 
depends on the parameters of the theory. For My < Mọ strings with |N| > 1 decay 
into |N| = 1 strings; conversely, for My > Mọ energy decreases if strings merge 
(these two cases correspond to type II and type I superconductors in condensed 
matter theory). The production of strings with |N| > 1 is suppressed in the early 
Universe; the phase transition creates predominantly strings with |N| = 1. 
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To understand how cosmic strings affect the space-time geometry, let us find 
energy-momentum tensor of a string. The general formula for the theory (12.17) is 


Tuv = —Loyw + 2Du¢* Did — FurFipg’”. 
We immediately find that 
Too = —T., =al, 


Rotational symmetry in the (x, y) plane dictates the form of (ij)-components: 


1 
Ti = TO (p)di; + T (p) (rins — stu) (12.26) 
where 
Ti 
ni = — 
p 


is unit radius-vector on the plane (x,y). Note that the configuration (12.22), and, 
consequently, energy-momentum tensor are invariant under Lorentz boosts along 
the string. Clearly, this implies that motion of the entire configuration along the 
string direction is unphysical. The notation in (12.26) has to do with the fact that 
the first and second terms in the right-hand side have angular momenum squared 
equal to 0 and 4 = 2?, respectively. 


Problem 12.6. Show that 


1 1 
I? (rons = 55) =4 (rons = stu). 


where L = —10/00 is angular momentum on the plane (x,y). 


In what follows we will be interested in length scales much greater than the 
string thickness, p >> ls ~ (ev)~+. In that case the relevant object is the energy- 
momentum tensor integrated over the string core, 


J p dp Tuv- 


Let us see that the integrated trace part T of the (ij)-components vanishes. We 


have explicitly 
1 1 
TO = -Tpk = —Fin Fin —V 
5 lik = 7 Fin Fiz (¢), 


where V(¢@) is the scalar potential. Therefore, we write 


J p dp T® oc f Pr (Fara = vo): (12.27) 


To see that this integral is equal to zero, we recall that the string configuration is a 
minimum of the energy functional (12.23a), so this functional is stationary against 
all variations of the fields that vanish at infinity. Let us consider a configuration 
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b(a') = (Krt), A;(a?) = «A;(Kx!), where the right-hand sides involve the fields 
of the string configuration, and the parameter « is close to 1. Upon the change of 
variables zê — «zt, the energy functional for the new configuration is 


2 
— [ġ, Ai] =| Pa [Dis Dios TF + ?V(¢)|. 


This expression as function of x must have a minimum at « = 1. This is the case 
only if the integral (12.27) vanishes, which is the desired result. We note that the 
above scaling argument goes back to Derrick and works for wide class of soliton-like 
solutions. 

Thus, seen from far away, the energy-momentum tensor is 


Ty = u: diag(1, 0,0, -1)6(x)6(y) + T), (12.28) 
where the only non-vanishing components of Te are 
1 


and p is string tension estimated according to (12.24). 


Problem 12.7. Obtain the structure of the tensor (12.28) from the arguments based 
on dimensions, localization of energy, rotational symmetry, energy-momentum con- 
servation and invariance under boosts along the string direction. 


We discuss the effects due to the first term in (12.28) later on, and here we 
show that the second, quadrupole term is, in fact, irrelevant. To this end, let us 
study linearized metric perturbations h,,, about Minkowski metric. Let us choose 
harmonic gauge, 

1 
Ihi — sun, =0. (12.30) 


Then linearized Einstein equations are (see Sec. A.9) 


1 
hyv = —167G G = Imet), (12.31) 


where = 0,0 is D’Alembertian in Minkowski space-time. Since the string 
energy—momentum tensor is static and independent of z, the solution is independent 
of t and z and obeys 


1 
Aohyy = 167G zZ - smeT) (12.32) 


where Ag is 2-dimensional Laplacian on a plane (x,y). Let us consider the effect 
of the term (12.29). The metric perturbation induced by this term has the same 
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angular dependence as in (12.29), 


1a ( AMP) 1 Phe? 1 4 1 
Ash?) SS ae ( 2 te TLZ (ae + ERO! — Zn) (rans = 5%) E 


Away from the string Ag rg = 0, and the solution decays as 


A) x = 
p 
(Another solution grows like p? and is unphysical.) We conclude that the quadrupole 
component of the metric induced by T rapidly decays away from the string: on 
dimensional grounds TA ~ Gul?/p?. This component may thus be neglected at 
large distances we consider, p > ls. 
To summarize, for studying the long-distance effects, the string energy- 
momentum tensor can be set equal to 


Ty = u: diag(1, 0,0, —1)6(x)d(y). (12.33) 


The formula (12.33) represents the approximation of infinitely thin string; this 
approximation will be sufficient for our purposes. 

The only non-vanishing components of the energy-momentum tensor (12.33) are 
the energy density Too and pressure along the string T33. They are equal up to sign, 
so we are dealing with a relativistic object which cannot be studied in the Newtonian 
approximation. Still, one expects that the space-time metric is nearly flat far away 
from the string, so that one can use linearized equation for the Newtonian potential 
®. In background Minkowski space the latter is defined by goo = 1 + 2® and obeys 
the equation (see (A.119) and (A.121)) 


1 
A® = 81G (in — T = 4rG(Too + Tii + To + T33). (12.34) 


The Newtonian potential is zero for the source (12.33) since the right-hand side 
of (12.34) vanishes. Hence, gravitational field is absent away from the string core; 
straight strings neither attract nor repel each other or surrounding matter! We will 
discuss the geometry of space in the presence of a string in Sec. 12.3.3. 


We discuss here straight strings. For strings with ripples, the energy-momentum tensor can 
also be treated as localized along a line, provided that one considers distances larger than 
the length scale of ripples. However, in that case the energy density Too and pressure along 
the string T33 no longer coincide up to sign; the effective energy-momentum tensor obeys 
|Too| > |T33|. The energy density Too is somewhat larger than tension of straight string, 
Too > p, since a distant observer sees “more string” per unit length. For similar reason 
T33 < u. It turns out that energy density and pressure are related by Too - |T33| = pu”. Since 
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|Too| A |T33| for strings with ripples, the right-hand side of Eq. (12.34) does not vanish: 
strings with ripples generate static gravitational field. 


As it evolves, cosmic string sweeps a (1 + 1)-dimensional manifold, world sheet. 
The action for thin cosmic string equals the area of the world sheet (just like the 
action for a particle is the length of its world line): 


S= -n | Vres, (12.35) 


Here €° and €' are timelike and spacelike coordinates parameterizing the world 
sheet, X” = X#(€) are space-time coordinates of a point on world sheet, and 
y = det(yYag), where 


Yas = Og X"(E\O8X” (E) guar 


is the induced metric on world sheet. This action is known as Nambu—Goto action; 
it describes the dynamics of cosmic strings everywhere except for their crossing 
points where finiteness of string width is important. 


The Nambu-—Goto action can be obtained as the leading approximation to the action of 
moving curved cosmic string in the Abelian Higgs model (12.17). The idea of the derivation 
is to find approximate expression for the string configuration, insert it into the action 
(12.17) and integrate over coordinates transverse to the string. 

Let ¿° and €' be timelike and spacelike coordinates on the manifold X“(€) where the 
scalar field @ takes zero value. This manifold is identified with the string world sheet. 
Assuming that the string curvature is small compared to its thickness, one approximates 
the string configuration by Eq. (12.22) generalized to nonzero string velocity. The two 
vectors parallel to the world sheet at a given point X“(€) are OX"/0€°, and one can 
choose two tangential spacelike vectors el), a = 1,2, such that they are orthonormal, 
eel) guv = —§°%, and orthogonal to the world sheet, e 0X" /OE° = 0. The coordinates 
x” of each point near the world sheet can now be written as 


2 
z” = s” (C) = X*(E) + Doe En C= ELE), 


where we introduced two new coordinates n°, a = 1,2 which in the case of straight string 
parameterize points on the plane orthogonal to the string. The Jacobian of the coordinate 
transformation from x“ to Ç" is 


a Ox" ðr” 
Zg- det ($2) = ,/—det (ou) SNE (12.36) 


where we write only the terms which are not suppressed by the radius of the string 
curvature. 
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In terms of new coordinates, the string configuration is 


a=2 


P(O =o (n,n?) AEO = Do eA n,n’), 


a=1 


where (s) denotes the configuration of straight string in the plane (71,72). This is the same 
configuration as in (12.22) with n,n? substituted for 21,22. To the leading order in the 
string curvature we have 


Dd" Do x Dpo®* De®, FZ, = FS?, V(b) = V(b). (12.37) 


pv 3 


The Nambu-Goto action is now obtained by substituting the Jacobian (12.36) and expres- 
sions (12.37), (12.17) into the action of the Abelian Higgs model and integrating over the 
transverse coordinates 7°. 


Problem 12.8. Show that there are no corrections to the Nambu-Goto action (12.35) 
which are linear in the ratio of the string thickness to its radius of curvature. 


12.3.2. Gas of cosmic strings 


For a distant observer, a closed cosmic string of radius R looks like a particle of mass 
M, = 27Ru. If dissipation of string energies (say, by gravitational wave emission) 
is negligible, their effect on expansion rate is the same as that of relativistic or 
non-relativistic matter, depending on their velocities. In fact, the dissipation is 
important; we consider the realistic situation in Sec. 12.3.4. 

The properties of the gas of infinite strings are less obvious. In what follows 
we consider non-interacting strings: indeed, their interaction mediated by the fields 
A, and @ weakens exponentially with separation,® while gravitational interaction is 
weak. To find equation of state for gas of cosmic strings, consider first the config- 
uration of N straight strings at rest, which are parallel to z-axis and separated by 
distance L well exceeding the string thickness. Energy-momentum tensor of such a 
configuration is 


TO (x ,y) = u- diag(1, 0,0, — DJ e-a (y — yi). 


In the limit of large number of strings the average tensor becomes 


[Ti (w, y)dedy _ 
f dxdy 


This expression is valid for static configuration. Let now the strings move along the 
x-axis at velocity u. Energy-momentum tensor is obtained from (12.38) by Lorentz 


(TO) = £ - diag(1,0,0,—-1). (12.38) 


6We consider here Abrikosov—Nielsen—Olesen strings; the situation for global string is different. 
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boost,’ 
vy yu 0 0 
(Ta) = E yu Pu 0 0 y= -t 
ue L2 0 0 0700 V1 — u2 
0 0 0 —1 


We are going to average over the boost directions, so we omit terms linear in u. 
The tensor averaged over boosts along x and y is (Recall that the motion along the 
z-axis is unphysical for strings stretched along z-axis.) 


yu? yu? i) 


A H g; 2 
Ta v) = I2 . diag (, ’ 2 ’ 9 
We now repeat the procedure for string configurations parallel to x- and y-axes 
and average over all three directions. The resulting energy-momentum of the gas of 
infinite cosmic strings moving with velocity u in random directions is 
29 2.2 2.2 
= M D ea E o a 1 a 1 
Tis = (Tw) = fz diag C = ET). (1239) 
As expected, in the limit u — 1 the equation of state of the string gas coincides 
with that of radiation, p = p/3. More relevant for the Universe is the equation of 
state for slow strings 


1 
p=-3P (12.40) 


According to the results of Sec. 3.2.4 this gives 
p,p xa (t). (12.41) 


The physical reason for this behavior is that expansion affects distances between 
strings in two transverse directions, rather than in three directions as in the case of 
particles. 


Problem 12.9. Neglecting string collisions and dissipation of their energy (which 
is not in fact a good approximation), estimate the number of strings in the visible 
Universe, if they were produced in the course of phase transition at Te ~ 100 GeV. 
The same for Te ~ 10!° GeV. Hint: Assume that strings were produced by the Kibble 
mechanism, so that just after the phase transition there was one piece of string of 
Hubble length per Hubble volume. 


The dependence (12.41) shows that strings, if exist, could become important 
at later stages of the cosmological expansion. Their energy density behaves in the 
same way as the contribution of spatial curvature to the Friedmann equation, see 
(4.2), and decreases slower than that of radiation or non-relativistic matter. 


? 


TThis transformation accounts not only for the change of the energy-momentum of each string 
but also Lorentz-contraction of the distance between the strings. 
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Problem 12.10. Under conditions of Problem 12.9 and assuming, within the 
Abelian Higgs model (12.17), that strings were produced in the course of phase tran- 
sition at Te ~ v, obtain the bound on the energy scale v by requiring that the string 
gas makes small contribution to the present energy density. 


If the late time expansion of the Universe were dominated by cosmic string gas, 
the scale factor would grow as 


a(t) xt. 


This shows that string domination at present epoch is inconsistent with observa- 
tions: the cosmological expansion accelerates. In fact, CMB and SNe Ia data rule 
out string domination at a high confidence level (see (4.51) and Fig. 4.10); the cos- 
mological data require dark energy instead. It then follows from (12.41) that strings 
never dominated and, if dark energy density is constant or almost constant, they 
will never dominate in the future. 


12.3.3. Deficit angle 


Let us now consider other effects of infinite or very long cosmic strings. We begin 
with studying the spatial geometry in the presence of an infinite string. Even 
though gravitational potential due to cosmic string vanishes away of its core, the 
effect on geometry is non-trivial. To see this, we consider small perturbation hy, 
about Minkowski metric. In harmonic gauge, it obeys Eq. (12.32) where the energy— 
momentum tensor is given by (12.33). The only nonzero components of the combi- 
nation (Ty — nuv TÀ) in the right-hand side of Eq. (12.32) are (11) and (22). Thus, 
the nonzero components of perturbations are hy; and h22, and they both obey 


Azhi1(22) = 167Gyd(x)d(y). 


We conclude that in the harmonic gauge 


2 2 
huv = 4G log (2 = ) - diag(0, 1, 1,0), (12.42) 
0 


where po is a parameter of dimension of length, determined by the string thickness. 
With perturbation included, the metric is 


2 
ds? = dt? — dz? — h — 4Gu log (5) - (d? + p’ dé”), 
0 
where we switched to cylindrical coordinates. The latter expression is valid for 
Gu <land p > po, but 4G log (<) < 1; in this case the perturbation (12.42) is 
0 


small while the details of the string core are irrelevant. It is convenient to perform 
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coordinate transformation to the radial coordinate p, such that 


2 
dp? = (1 — 4G log (5)) dp?. (12.43) 
PO 


To the first order in Gu this gives 
~ P 
p=p- (1 -4Gulog Ż + 46x). 
Po 
and 


2 
P- h — 4Gulog (5) = p+ (1—4Gp). (12.44) 
0 


It follows from (12.43) and (12.44) that the metric of static straight string stretching 
along z-axis can be written as [233] (we omit tilde over p from now on) 


ds? = dt? — dz? — dp” — (1 — 4G u)? p’ db". (12.45) 


Problem 12.11. Find asymptotically flat form of metric for string with ripples 
whose energy-momentum tensor is Ty, = diag(u’,0,0,—p"’)d(x)d(y), Wu” = p’, 
u’ > u”, to the first order in Gu', Gu”. Show that space-time is curved outside the 


string core. 


Metric (12.45) has conical singularity. Circles p = const have length smaller than 
2mp. Metric (12.45) becomes Minkowskian upon the coordinate transformation 


0 — (1 — 4Gu)0. 
However, the polar angle 0 now takes values in a narrower interval 
0 <0 <2r(1— 4G). 
In this regard, one introduces the notion of deficit angle, whose value is 
A9 = 81Gu. (12.46) 


Thus, space-time is locally flat but the geometry on (x, y)-plane is conical.® 

The presence of deficit angle leads to a number of interesting physical phenom- 
ena. One is double image of an object behind the string. To illustrate this pictorially, 
let us use the coordinates in which metric is Minkowskian. Consider an observer 
at distance d from straight cosmic string: this is the point O in Fig. 12.3, while 
the string is at point S; the figure shows tangential plane to the string. The lines 
SA’ and SA” must be identified, and the shaded region cut out; the deficit angle 
is A@. Light rays coming to the observer at equal angles a’ = a” to the direction 
OS towards the string, actually emanate from one and the same point, since the 


2 
8We obtained the expression (12.45) for 4Gy log (4) <1. It is clear, however, that (12.45) is a 
0 


solution to the Einstein equations for larger p as well: the metric is locally flat and its Ricci tensor 
vanishes, Ruy = 0. 
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Fig. 12.3. Light propagation from a point A’ = A” behind the string S to an observer O. 


points A’ and A” are identified. Thus, the observer will see two images of the object 
placed at A’ = A”. The angular distance between the two images, Aa = a’ +a”, 
is proportional to deficit angle. For small A@ one has 


Ad = A ha, (12.47) 


where l and d are the distances from the string to the source and observer, respec- 
tively. While the distance to the source is measurable (say, by determining redshift), 
the distance d to the string is not. Thus, the measurement of the angular distance 
can only give the lower bound on deficit angle, 


A0 > Aa. 


According to (12.46), deficit angle is proportional to the string tension, so a mea- 
surement of Aa would place a bound on the energy scale v of the theory, 


To see this lensing effect in the coordinate frame (12.45) one would cut out 
the shaded region in Fig. 12.3 and glue together its boundaries SA’ and SA”. The 
resulting surface would be a cone with apex at the string position S. Light can 
travel from the source A’ = A” to the observer O along two straight lines, which 
pass left and right of the apex. Hence the two images of the source. 


Problem 12.12. Making use of the geodesic equation in metric (12.45) give alter- 
native derivation of the lensing effect of cosmic string. 


We neglected the cosmological expansion in the above analysis. In general, however, the 
relationship between A0 and Aa depends on cosmological evolution. As an example, the 
analog of the relation (12.47) in flat matter dominated Universe is 


= (14 ao 


Aa = 40. (1-H 
j ( 1- (Fa) 


(12.48) 


where zs and z4 are redshifts of string and source, respectively. 


Problem 12.13. Derive the relation (12.48). Find similar relation for the Universe domi- 
nated by cosmological constant. Generalize that formula, as well as (12.48) to string inclined 
to the plane of Fig. 12.3 at angle 6. 
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We have considered point-like source for simplicity; in that case the observational 
proof that two images are due to single source would be the identity of the two 
spectra. The situation with extended source is more complicated, but in that case 
too, there exists a region behind the string such that sources in it produce two 
identical images. 


Problem 12.14. Find the shapes of images of a spherical object behind the string 
in a general case. 


Another phenomenon due to cosmic string is specific distortion of CMB 
anisotropy. If a string and an observer are at rest with respect to CMB, then lensing 
by the string leads to repetitions in the pattern of anisotropy. The source is now 
the surface of last scattering, and the distance to it much exceeds the distance to 
the string. Therefore, the angular distance between identical temperature spots is 
equal to deficit angle, A0 = Aa. 

If the string moves in the direction transverse to the line of sight, there is a new 
effect of systematic shift between frequencies (and hence temperature) of photons 
passing the string on different sides [237]. This effect is illustrated in Fig. 12.4, 
where the coordinate frame and notations are the same as in Fig. 12.3. Let us 
again consider two images A’ and A” of one and the same source. Let the string S 
move with velocity u, in direction normal to the line of sight. Then all points on 
the line A'S A” move with the same velocity. The source A’ has velocity u} whose 
projection on the line OA’ points in the same direction as the photon momentum. 
For the source A” this projection is opposite to photon momentum. Hence, the 
longitudinal Doppler effect leads to shifts of photon frequencies 


A0 A 
OA’: Aw = UL OA": Aw = -u 2, 


where y = 1/,/1— u? and we have set Aa ~ A9, having in mind CMB photons. 
We see that photons crossing the string trajectory in front and behind the string get 
redshifted and blueshifted, respectively. This leads to additional CMB anisotropy. 


Fig. 12.4. Light propagation from source A’ = A” behind the string S moving with velocity u_. 
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Fig. 12.5. Motion of a string S in dust-like medium. 


For small A@ and realistic u; the effect is small, 


ôT u 
m = WAG =17- 0 i Goan” 
Yet the analysis of CMB angular spectrum together with data on large scale struc- 
ture rule out string networks [238] with u = (0.6 - 1016 GeV)? and single string 
stretching through the visible Universe with u > (1.0 -1016 GeV)? [261]. 

It is known from observations that CMB temperature fluctuations are random 
and Gaussian. The small jump of temperature across the string projection onto 
celestial sphere is a non-Gaussian feature. Non-Gaussianity of this sort has not 
been observed, which also leads to a bound on p [239]. This bound is of the same 
order as other bounds, u < (0.7 - 1016 GeV)? for an infinite straight string moving 
with velocity u; = 1/v2. 

Yet another effect due to cosmic string, moving now in dust-like medium, is 
the formation of overdense region (sheet-like wake) behind the string [240]. This is 
illustrated in Fig. 12.5, where the coordinate frame is the same as in Fig. 12.3. The 
string is normal to the plane of the figure. Let there be two dust particles A; and 
Ag in this plane. Let these particles be at rest and at distance r/2 from the string 
trajectory OS. The string moves with velocity u. After the string passes between 
the particles, the latter move towards each other and meet behind the string. (The 
meeting point is Ai = A} in Fig. 12.5; recall that the lines SA) and SA‘ are 
identified.) 

Let us calculate the velocities of the dust particles in direction towards each 
other. For particle A, this direction is orthogonal to the line SA‘. In the string rest 
frame this particle moves along the line A; A4 with velocity u. Hence, its velocity 
at point A‘ in the direction to particle Ag is 


A9 
vy = u: sin DES 4rG uu, (12.49) 
where we set Gu < 1. Taking, as an example, u ~ 0.1, we find numerically 
Avy| = 0.8 -1078 . — E. 
[Avy| = 0.8 10°" - GeV 


We see that this velocity may be fairly high. 
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12.3.4. Strings in the Universe 


Relatively large velocities of particles in wakes of moving cosmic strings lead to 
formation of overdense regions there. These overdensities may in principle serve as 
seeds for formation of structures — galaxies, clusters of galaxies, etc. The structures 
formed in this way would stretch along the string trajectories and hence they would 
be 2-dimensional. We note here that galaxy distribution indeed shows that there 
are 2-dimensional and also 1-dimensional structures, walls and filaments. However, 
detailed analysis reveals that only small fraction of matter gets into the wakes [241], 
so that observed voids (galaxy-poor regions) do not form. 

Another structure formation mechanism in models with cosmic strings is accre- 
tion of non-relativistic matter onto string loops [242, 243]. The resulting density 
perturbation spectrum is almost scale invariant, in accordance with observations. 
However, the string mechanism of structure formation predicts CMB anisotropy 
angular spectrum in gross contradiction to observations [244]; see Fig. 12.6. Hence, 
string mechanism of structure formation cannot be dominant. The bounds on the 
string contribution into CMB anisotropy, and hence into structure formation, are 
at the level of 3%. Even stronger limits come from the search for B-mode of CMB 
polarization, since strings lead to the generation of tensor and, most notably, vector 
perturbations in metric and matter which induce the B-mode. (See the accompa- 
nyng book for the discussion of CMB polarization.) 

Many properties of dynamics of cosmic strings cannot be found analytically, so 
one relies upon numerical simulations. These show, in particular, that just after the 
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Fig. 12.6. CMB temperature angular spectra in ACDM model (solid line with peaks) and pre- 
dictions in in models with density perturbations generated by cosmic strings and textures [244]. 
(See Sec. 12.5 for discussion of textures.) The absence of peaks in the latter models is due to the 
fact that perturbations are never superhorizon and are are generated during fairly long period of 
time. For this reason tha phases of oscillations in (1.23) are random. 
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phase transition the mass fraction of infinite strings is four times larger than that 
of closed strings. The initial velocities of long strings are small, but they increase 
in time and on average become rather large, u ~ 0.15. Numerical simulations have 
shown also that the energy density of cosmic strings decreases in time, so that it 
never dominates the cosmological expansion. Hence, unlike monopoles and domain 
walls, models with strings produced in phase transitions are still cosmologically 
allowed. 

There are two main processes determining the evolution of cosmic strings in the 
Universe. One is string intersections and self-intersections, which lead to production 
of loops out of long strings. The second is gravitational wave emission by relatively 
short strings, which leads to disappearance of small loops. Let us first discuss the 
latter process. The power of gravitational wave emission is determined by the third 
time derivative of the quadrupole moment [83] Q of a string loop, 


1 (BQ? 
Pow ~ aaa \ as) > 
Mz, \ dt 
The quadrupole moment of a loop of radius R is estimated as Q ~ uR?. The string 
equation of motion which follows from the action (12.35) shows that loops rotate 
and oscillate at velocities of order 1, i.e., dR/dt ~ 1. This gives 
2 
u 
Pow = Cow => 
g g M2, 
where the constant Cy, is obtained by numerical simulations and turns out to be 
rather large, Cyw ~ 10? (see, e.g., [234]). Thus, loop looses its energy at the time 
scale 
HR MpR 


tow © , 12.50 
g Pag Cpt ( ) 


As a result, the size of the loop becomes of order of its thickness, R ~ v~', and the 
string decays into high energy particles.? 

Let us now turn to string intersections. Numerical analysis shows that inter- 
section almost always leads to reconnection [235], so self-intersections give rise to 
decays into shorter loops. Somewhat unexpected result of numerical simulations of 
cosmic string dynamics in the Universe is that the evolution of the cosmic string 
energy density soon after the phase transition coincides with the evolution of the 
total energy density, p x t~? [236]. At any given moment of time each Hubble vol- 
ume contains a dozen long strings stretching behind the horizon, many long closed 
strings and numerous small loops. This behavior is due to efficient production of 
closed strings and their subsequent decay via gravitational wave emission. 


°We note that this is one of the possible mechanisms of generation of ultra-high energy cosmic rays. 
For p = (1018 GeV)?, this mechanism works with string loops of present size of order R ~ 1 Mpc: 
it follows from (12.50) that the lifetime of these strings is of the order of the present age of the 
Universe. 
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To understand this picture at qualitative level, let us estimate the energy density 
of closed loops at radiation domination. The number density of loops of size R, where 
R « H-t, decreases due to the cosmological expansion, 


m(R, t) « a(t)? ~ t’. 


In the scale-invariant regime we obtain on dimensional grounds 


m(R, t) ~ F ( E 


(R)? \ Bt 
where E ~ Ry is the mass of a loop. The energy density is then 
E E 
man dnı(R, t) py\3/2 [7m dE 
i) ~ bap = (£) I 3 12.51 
pil ) [ 4E t Ear E3/2 ( ) 


This integral is saturated at lower limit, hence 


p(t) ~ (4) a= (12.52) 


where Emin is the minimum possible loop energy. We see that the energy density 
is dominated by the lightest but numerous loops. 

The minimum size of loops is determined by the requirement that the lifetime 
(12.50) exceeds the Hubble time, so we get 


2 


t, Emin(t) ~C a t 


H 
Rmin thee wap. 
( ) g M2, 


My 
We finally obtain 


_7 VEMp: 1 


t = 
pit) Jane 


i.e., the energy density of strings indeed follows that of radiation, praa ~ M},/t?. 
The relative contribution of strings remains small, p1/Praa ~ y u/Mẹ Still, it 
is much larger than the relative contribution of infinite strings, which is of order 
u/M?,. This shows that it is small loops rather than long strings that may affect 
the spectrum of density perturbations. 


The behavior m(R,t) x (Rt)~3/? can be seen from the following argument. Let nj(R, t) 
be the number density of loops of size R at time t. Loops of sizes in interval R and R+dR 
have energy density 


pi(R, t)dR = wRni(R,t)dR. 


Energy density of loops decreases due to the cosmological expansion, and at the same time 
more loops are produced by (self)intersections of long strings. Scale-invariant behavior 
means that the loop production is determined by a scale-invariant function which we 
denote by f(R/lx), where ly is the horizon size, the length scale inherent in the Universe. 
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This function parameterizes the energy loss of long strings due to production of loops of 
sizes between R and R + dR as follows, 


nE - F(R/ln). 


Hence, the equation for balance of loops is 


dpı(R, t) 


S + BH (a R,t) = Fe f(R/ln), 


where H(t) = 1/(2t) and ly = 2t. The solution to this equation is 
PR) = eo |, VEED 
= 16 TRAFA Jaje 


For short loops, R/t — 0, this indeed gives the evolution law (12.51). 

All the discussion above assumed that the sizes of loops produced in intersections are 
much larger than the minimum size. Numerical simulations [245] show that the loop sizes 
are indeed comparable to the horizon size; at both radiation and matter domination the 
typical size of a newborn loop is R(t) = at, a ~ 0.1. We note that earlier simulations 
with worse resolution could not determine the value of a but suggested that it was small, 
a < 1. The interpretation was that the typical size of a newborn loop was determined 
by the size of ripples of long strings rather than the horizon size. Were that the case, the 
newborn loops would decay in Hubble time, and would be irrelevant from the viewpoint 
of density perturbations. More sizeable would be density perturbations created by a dozen 
of moving infinite strings. 


Finally, let us estimate the energy of gravitational waves produced by cosmic 
strings. With account of redshift of their frequencies, the energy density in gravity 
waves Pgw obeys the equation 

dpi oVEMPi Mp L 


ea 6 F 


Its solution at radiation domination is 


ipf ad Mp, 1 t 
Paw = al (-#) tdt ~ ya Fl = log (12.53) 
ty V Cow t ty’ 


where tı denotes the time when the first gravity waves are emitted, i.e., the time 
when the cosmic strings begin to decay. The energy density in the Universe at 
radiation domination is (see Sec. 3.2) 

_ 3M3, 

~ 82qt?? 
hence the relative contribution of gravity waves is given by 


Pow 64r ph t 


~ log —. 
p 3Mri (Cw h 
We see that this contribution grows slowly, logarithmically, but it can be quite large. 
Decaying strings may be intense source of relic gravity waves. 
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The effect of gravity wave production enables one to place strong bounds on 
models with cosmic strings. In particular, pulsar timing measurements give the 
bound [246] 


u S (2-10? GeV)?. 


We note that additional sources of gravity waves are singularities on strings, kinks 
and cusps. These singularities emerge both as a result of the evolution of closed 
strings and due to (self-)intersections; see, e.g., Ref. [247]. The singularities have 
velocities close to the speed of light and emit intense pulses of gravity waves. The 
singularities may also be sources of particles of large masses and/or super-high 
energies. The latter aspect is of interest for cosmic ray physics. 


The possibility of production of strings in the Universe is considered also in the context of 
superstring theory, the candidate theory unifying gravity with other forces. In superstring 
theory, there exist fundamental strings and other string-like objects. In simple cases their 
existence in the Universe is ruled out: their tension is in the range u ~ (1 — 107°?) - M2, 
which is inconsistent with CMB data. Furthermore, production of very heavy strings is 
problematic from the viewpoint of inflationary theory. Finally, in simple versions of string 
theory long strings are unstable at cosmological time scale. 

Nevertheless, there are versions of string theory in which strings have interesting and 
phenomenologically acceptable properties; see e.g. [248] and references therein. Besides 
fundamental strings (F-strings), another type of objects, D-strings, is of importance. This 
is a subclass of so called Dp-branes (where p refers to spatial dimensionality of the object). 

Properties of these types of strings are quite different from those of cosmic strings. 
Production mechanisms of F- and D-strings are also different from the Kibble mechanism. 
Unlike cosmic strings, they have small probability to reconnect when intersecting. Fur- 
thermore, intersection of F- and D-strings may lead to formation of hybrid F D-strings 
connecting F- and D-strings. The production of loops at reconnections, and hence energy 
loss, is thus strongly suppressed. In some string theories, F-, D- and F D-strings form sta- 
ble networks of cosmological scale. Tensions of these strings need not be extremely large, 
so such networks are not ruled out. 
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Let us consider the simplest field theory model admitting domain wall solution. 
This is the model of one real self-interacting scalar field ¢ with the Lagrangian 


1 À 
L = 50,60" — rea —v?)?, (12.54) 
This Lagrangian is invariant under change of sign of the field, ¢é —> —¢ (symmetry 


group Z2). As the result of spontaneous symmetry breaking, the field ¢ acquires 
vacuum expectation value 


(9) =+v or ($) =v, 
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i.e., the space of vacua is disconnected and consists of two points. Phase transition 
in the Universe leads to domains of two types: one with positive field’® (p) = +v, 
and another with negative value (¢) = —v. The field configurations interpolating in 
space between different vacua ¢ = +v and ¢@ = —v are known as domain walls. 


The general necessary condition for the existence of domain walls is, in notations used in 
(12.13), 


ro(G/H) # 0. 
This condition means that the vacuum manifold M = G/H consists of several disconnected 


components. 


In the simplest case of static infinite domain wall stretching in flat space along 
the plane z = 0, the profile depends only on z and is the solution to the field 
equation 


ao 

dz? 
with boundary conditions @(z => +00) = +v (kink) or ¢(z => +00) = 7v (antikink). 
These configurations are given by 


Ag? — v?)b = 0, (12.55) 


ae _ z 2_ 2 
kink: ¢(z) = vtanh7, At = YW (12.56) 
antikink: (z) = —v tanh—. (12.57) 


A 
Problem 12.15. Show that kink is indeed the solution to Eq. (12.55). 


The parameter A has the meaning of the domain wall thickness. To see this, let us 
calculate the energy-momentum tensor of the domain wall. The general expression 
for the energy-momentum tensor is 


Ta = ORO = Lr, 


and for the solution (12.56) we obtain 


4 
ee. dv 


= ————diag(1, —1, —1, 0). 12.58 
w = oA g( ) (12.58) 


We see that the energy-momentum tensor does not depend on x and y and is 
nonzero only in the region —A < z < A. Thus, A is indeed the wall thickness. We 
note in passing that domain walls produced in the Universe are not flat, generally 
speaking. Nevertheless, the kink approximation works well unless domain walls are 
strongly curved. 


10The expectation value of ¢ at finite temperature does not coincide with v, but this is unimportant. 
for what follows. 
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The integrals 
2V2 
n= faa = ZVA, and [rita E |aru = -n 


give the surface energy density of the wall and pressure along the wall. These are 
equal up to sign.!! We note that TRY = 0, which means that there is no tension 
across the wall. 

Large spatial components of the energy-momentum tensor imply that domain 
walls are relativistic objects, which may be expected to have non-trivial properties 
once gravity is turned on. To illustrate this, let us make use of the linearized equa- 
tion (12.34) for the Newtonian potential. With the domain wall energy-momentum 


tensor (12.58) we obtain 
A® = —4nGT2". 
Its solution in the thin wall limit, Too = 74(z), reads 
® = —27Gn]z|. (12.59) 


This would give gravitational field of opposite sign as compared to point-like object. 
Hence, if a domain wall could be at rest it would antigravitate: non-relativistic 
particles would be repelled by domain wall. 

In fact, the notion of a static domain wall, as well as usage of the linearized Ein- 
stein equations are not valid. Hence, the picture we just presented is oversimplified. 
We have given it only to illustrate that gravitating domain walls have non-trivial 
properties. 

The Kibble mechanism works for domain walls too. In general, after the phase 
transition the Universe consists of domains of different vacua divided by domain 
walls, which in turn contain closed domains of the opposite vacuum, etc. Domain 
walls make either open or closed non-intersecting surfaces. !? 

In vast majority of particle physics models, the existence of even a single “infi- 
nite” (horizon size) domain wall in the present Universe is ruled out. A simple way 
to see this is to estimate the energy of a domain wall of horizon size, Mpw ~ 7 Hy a 
and compare it to the total energy inside the present horizon, pe Ho 3. The require- 
ment that the wall energy is smaller than the total energy, 


nH’ S peHo”, 
gives the bound on the surface energy density, 


n S peHo ' ~ (10 MeV)?. (12.60) 


11 The latter property is no longer valid for curved domain walls. 

12We consider the model (12.54) whose space of vacua consists of two points only. There can be 
more complicated situations (e.g., there are models with symmetry Zn, n > 2). In those cases, 
there are more types of domains, and hence more types of walls; these walls may intersect. 
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Barring the possibility of extremely small couplings, this means that the energy scale 
of physics responsible for domain wall must be less than 10 MeV, which is very small 
from the viewpoint of extensions of the Standard Model. Reversing the argument, 
extensions of the Standard Model with discrete symmetries face the “domain wall 
problem” [249]: there must be a mechanism that forbids domain wall production in 
the early Universe, or a mechanism that destroys them in the course of evolution. 
This requirement is certainly far from trivial. 


Problem 12.16. Find equation of state for gas of non-interacting domain walls 
moving at non-relativistic velocities. Hint: Make use of approach similar to that of 


Sec. 12.3.2. 


To end this Section we note that there are other cosmological effects of domain 
walls besides their effect on the cosmological expansion. In particular, gravitational 
potential of a domain wall would affect CMB through the integrated Sachs—Wolfe 
effect discussed in detail in accompanying book. The dominant effect [262] comes 
from the variation of the gravitational (12.59) across the wall, which induces the 
change in photon frequency dw/w = A®. As a result, the CMB temperature in the 
direction towards the wall changes by 

a =-27Gyl, 
where | is the distance from the wall to the observer. Generically, this distance is of 
order of the present horizon length, | ~ Ho 1 and by requiring that 6T/T < 1075 
one obtains even stronger bound as compared to (12.60): 


n < (1 MeV)’. 
12.5. *Textures 


Textures are unstable topologically non-trivial field configurations [226, 250]. These 
are also produced in the course of phase transitions, and the role of topology is that 
their production occurs via the Kibble mechanism. Being unstable, textures do not 
survive up to the present epoch, and their presence in the early Universe can be 
detected only indirectly. One possibility is that they contribute to the generation of 
density perturbations that develop into structures in the Universe [251]. We note, 
though, that no effects due to textures have been observed so far. 

Like stable topological defects, textures may exist in theories with spontaneously 
broken global or gauge symmetry. An example of the former type is given by a 
model with global symmetry O(4) broken down to O(3). The model contains four 
real scalar fields y*, a= 1,...,4, and the Lagrangian is 


1 
L= soe Opp — org" =). (12.61) 
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Symmetry is broken at zero temperature, and the vacuum can be chosen as 
(p°) = vdq. (12.62) 


The fields yt, y?, p? are massless in this vacuum; these are Nambu—Goldstone 
bosons!’ corresponding to global symmetry breaking O(4) — O(3). The field h 
defined by h = yt — v acquires the mass mp which we assume to be large enough. 


Problem 12.17. By calculating the quadratic action for perturbations about vac- 
uum (12.62) show that the spectrum of the model is indeed as described above. 


The scalar potential of the model (12.61) vanishes for any fields obeying 
pry? =v. (12.63) 


This equation determines the vacuum manifold; in our case this manifold is a 
3-sphere S3 c The energy density of field configurations of the sizes greater than 
m,,* is not very large provided the relation (12.63) is valid at every point in space. 
These configurations have finite total energy as long as the gradients of y* vanish 
as r — 00, i.e., y*(x) tends to a constant independent of angles and r. Without loss 
of generality we set 


"(r > œ) = vdf. 


From the topological viewpoint this means that our 3-dimensional space effectively 
has topology of sphere 93 


space? 
one point in S3 e The field configurations y“(x) just described define, therefore, 


3. to 3-sphere S3 


space vac? 


ogy. The topological number is the number of times the sphere S3,,.. 


since all points at spatial infinity are mapped into 
mappings from 3-sphere S which may have non-trivial topol- 
is wrapped by 
the mapping Space ` Sac; this number, called degree of mapping, is integer. 

The simplest topologically non-trivial configuration is 


cos ġsin 0 sin x 


sin dsin f sin x (12.64) 


aS) 
II 
e 


cos # sin x 


cos X 
where ¢ and @ are spherical angles in our space, while the function y(r) obeys 
x(r=0)=0, x(r > ow) = 7. (12.65) 


This configuration has unit topological number. This is precisely a texture of mini- 
mum topological number. 


Problem 12.18. Show that the configuration (12.64) has topological number 1. 


13The existence of massless Nambu-—Goldstone bosons is a very general property of models with 
spontaneously broken global symmetries. It may lead to phenomenological problems; we do not 
discuss this aspect here. 
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The configuration (12.64) is unstable. Its energy is entirely due to the gradient 
term, 


1 
E= 5 | esve Vo". (12.66) 


Under rescaling x —> 2’ = az, energy (12.66) scales as Æ — E’ = aE. Hence, this 
configuration is unstable against shrinking;!+ once formed, texture shrinks. When 
its size becomes of order Mm,’ the property (12.63) need not hold any longer, and 
the texture unwinds and disappears. Its energy gets transmitted into energy of 
elementary excitations, Nambu—Goldstone bosons. 


We note in passing that in static 3-dimensional space whose geometry is S®, configurations 
similar to textures may be stable. The solution still has the form (12.64), but now x is the 
third coordinate on gee: Metric in coordinates x, 0, ¢ is 


d = a? [dy + sin? y(d6? + sin? Odd”). (12.67) 


The solution (12.64) is non-trivial on entire Se ce and vector (12.64) coincides with the 
normal to O sacks if the latter is embedded into fictitious 4-dimensional Euclidean space. 


Problem 12.19. Show that the field configuration (12.64) is a solution to the field equa- 
tions in the space with metric (12.67). 


Let us give a heuristic argument in favor of stability of the solution (12.64) in space 
with metric (12.67). Consider the family of field configurations of the following form: 


cos ¢sin 0 sin(y/a) 
sin ¢ sin 6 sin(y/a) 


p=v , p= 


cos ð sin(x/a) 
cos(x/a) 


(12.68) 


=. Oo O QO 


fr 0<x <ra, for tma<yx<T. 


They have topological number 1 and describe shrinking texture as a changes from 1 to 0. 
The energy of the configuration (12.68) is 


a? sin? 
2 2|1 sin 27a 
= 27r*-a-v E +a— Tra | ; (12.69) 


Since both a = 0 and a = 1 are minima of energy, both limiting configurations (i.e., the 
original configuration with a = 1, and shrunk configuration with a = 0) appear classically 
stable; there is potential barrier between the configuration (12.64) and shrunk texture, at 
least along the path in configuration space defined by (12.68). 


14This illustrates the fact that topology alone is not sufficient to ensure the existence of non-trivial 
stable configurations. 
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Textures are created in the course of phase transition by the Kibble mecha- 
nism. As the Universe expands, textures shrink and disappear by emitting Nambu- 
Goldstone bosons. The energy density in textures decreases, while the spatial regions 
where the field takes one and the same value expand and eventually become of the 
horizon size. The latter process is fast, so almost immediately after the phase tran- 
sition, there is about 1 texture of size H~1(T..) per Hubble volume H~°(T,). 

As the Universe expands, the process continues, and there remains about one 
texture of the horizon size per horizon volume. Since the energy density in texture 
is determined by the gradient energy, we have an estimate 


dp ~ (Ve)? ~ vw? H(t) ~ v7t7?. 


The background energy density has the same dependence on time at both radiation 
and matter domination, so textures give perturbations of energy density at horizon 
crossing independent of the spatial scale of the horizon, 


v2 


p 

T = const gz 
Importantly, these perturbations exist in a wide range of spatial scales, starting 
from H-!(T,). Hence, to zeroth approximation density perturbations have scale- 
invariant spectrum [251]. 

Approximate scale invariance is precisely the property of the measured spec- 
trum of density perturbations. However, the texture mechanism of the generation 
of density perturbations would lead to the CMB anisotropy spectrum inconsistent 
with observations; see Fig. 12.6. Thus, textures as the main source of density per- 
turbations are ruled out [244]. 


Problem 12.20. Estimate the number density and energy density of Nambu- 
Goldstone bosons produced in the Universe within the model (12.64). Which values 
of v are ruled out by BBN? What is the fraction of Nambu-Goldstone bosons in the 
energy of relativistic matter today for v = 1016 GeV? 


12.6. *Hybrid Topological Defects 


Some models admit structures consisting of topological defects of different dimen- 

sionality. This may be the case if spontaneous symmetry breaking occurs in two 

or more steps. Each step leads to vacuum rearrangement, possibly new topological 

properties of vacuum manifold and hence new types of defects. From a topology 

viewpoint, a chain of phase transitions 
CS Pets, Ti, >To>---, 

may give rise to vacuum manifolds with non-trivial homotopy groups 


ty, (G/H) #0, mn.(G/H2) #0,--- 
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Examples of hybrid structures are fleece (strings with ends on domain walls), neck- 
laces (strings with magnetic monopoles on them), etc. As an example, necklaces are 
produced in two subsequent phase transitions: 


stepl1: G—G' x U(1), 
step2: @ xU(1)—> H x Zy. 


The first step leads to monopole production, while at the second step these 
monopoles get connected by strings. The necklace case corresponds to N = 2, 
so that each monopole gets connected to two others. 

The evolution of hybrid defects is, generally speaking, different from that of 
topological defects constituting the hybrids. Their detailed discussion is beyond the 
scope of this book; we only point out here that among hybrids, the most interesting 
are necklaces. 


12.7. *Non-topological Solitons: Q-balls 


Besides topological solitons discussed in previous Sections, there exist other localized 
field configurations, whose stability is unrelated to topology. Particularly popular 
are Q-balls which are stable due to the existence of conserved global charge (hence 
the name) and absence of massless particles carrying this charge. 


12.7.1. Two-field model 


A simple model admitting non-topological solitons [252] contains real scalar field y 
and complex scalar field ¢. The Lagrangian is 


L = 8p Oth + 5px0"x — VO) — APIA, (12.70) 
where the potential V(x) has absolute minimum at 
x=v #0, (12.71) 
so that 
Vix=v)=0, Vix =0)=V>0. 
The mass of the field ¢ in vacuum (12.71) is 
fig = hv. 


We assume that the field ôx = x — v is also massive. 
The Lagrangian (12.70) is invariant under global U(1) symmetry 


eed, bo sed", xX. (12.72) 
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x(r) 


dgo(r) 


> 
TO r 


Fig. 12.7. Q-ball configuration: profile of the field y(r) and wave function of ¢-particle ¢o(r). 


Note that vacuum (12.71) with ¢ = 0 is invariant under this symmetry. Due to 
these properties, there exists conserved charge 


g= | d°x(og" — gÀ). (12.73) 


Let us find the state of minimum energy at a given value of Q. One of the states 
of charge Q is a collection of ¢-particles at rest in vacuum (12.71); their number 
equals Q. The energy of this state is equal to Q - mg. The competing Q-ball state 
has the following structure. Inside a region of yet to be determined size ro, the field 
x is zero, while away from this region this field takes vacuum value (12.71); x(r) 
smoothly changes from zero to v near the boundary; see Fig. 12.7. All Q particles 
¢ are sitting at the lowest energy level inside the region of vanishing x. Their mass 
is zero there, and the momentum is of order 1/ro. Hence, the energy of each of the 
¢-particles is b/ro, where b is of order 1, and their total energy is bQ/ro. The total 
Q-ball energy is thus 


Q 


4 
E= 3770 “Vo + Anrg-o +b—, (12.74) 
0 


where the first term is the energy of the field y inside the Q-ball (recall that Vo = 
V(x = 0)), and the second one is energy in the transition region near ro, o being 
the tension of the Q-ball boundary. We will find (see (12.76)) that the Q-ball size 


is large for large Q, so the boundary term can be neglected, and the @-ball energy 
is given by 


4 
E(ro) = 3770 -Vo + b=. (12.75) 
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We now minimize this expression with respect to rg and find the Q-ball radius 
at given Q, 


bQ \ 
= 12. 
o= (2) (12.76) 
so the Q-ball energy (mass) is 
Efro(Q)] = Mo = const - Vo/4Q?/4, (12.77) 


with constant of order 1. We see that @-ball energy grows with Q slower than 
the energy MQ of free ¢-particles in vacuum (12.71). This means that Q-ball is 
indeed the lowest energy state at sufficiently large Q. It is stable against decay into 
o-particles at 


Mo < mQ, (12.78) 


i.e., at Q > Qe where the critical charge is of order 


Qe ~v —- (12.79) 


Note that for the potential V(x) = A- (x? — v?)? the estimate (12.79) gives 


so that the critical charge is large for A ~ h? <1, i.e., in the weak coupling regime 
without fine-tuning of parameters. If the estimate (12.79) gives formally Qe < 1 (say, 
for À < h*), then the above calculation is inadequate for the critical Q-ball, and 
accurate analysis is required for finding Qe. We note also that the above calculation 
is made under two assumptions. First, we assumed that ¢-particles are the lightest 
particles carrying global U(1) charge, otherwise the estimate (12.79) would contain 
the mass of the lightest charged particle instead of mg. Second, it is important that 
g-particles are bosons, so they can all occupy the lowest energy level inside the 
Q-ball. 


Problem 12.21. Let the particles ¢ charged under U(1) be fermions. Are Q-balls 
similar to those studied in the text stable in weakly coupled theories with mg ~ My, 
where my is x-particle mass in vacuum (12.71)? Hint: Take for definiteness V(x) = 
AO? - vA <1. 


Let us give, for later convenience, an alternative but equivalent description of 
Q-balls entirely within classical field theory. Such a description is adequate at large 
Q. In classical field theory, one considers a classical configuration of the fields x 
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and @, instead of studying the state of quantum ¢-particles. For time-independent 
x and time-dependent ¢ the energy is 


E= pax G -b +V- Vb +h? x7 + ivy -Vx + vow): 


To construct the Q-ball, one finds the minimum of this functional at given value of 
classical charge (12.73). As the ansatz for x(x) one again chooses the configuration 
of Fig. 12.7, while for ¢(x, t) one writes 


(x,t) = Ae’? f(x), (12.80) 


where A is yet unknown amplitude and f(x) is normalized by 


[eorex =i, (12.81) 
Making use of the latter condition, one finds the energy and charge 
4 
E =W A3 +A -Es t+ 370 Vo (12.82) 
Q = 2wA?, (12.83) 


where we again neglected the surface term in energy of field y and introduced the 
notation 


= / Px(\V FP + h2x2(r)/f2). 


One chooses the function f(x) in such a way that it minimizes Eş under normaliza- 
tion condition (12.81). Thus, f(x) obeys 
-Af +h Êl =X f, (12.84) 
where À is the Lagrange multiplier. Equation (12.84) coincides with the Schrödinger 
equation in potential h?x? (r); its lowest eigenvalue A determines Ey: 
f= x. 
Thus, f(x) coincides with the wave function of the ground state of ¢-particle in 
Q-ball (fo(r) in Fig. 12.7), and 
b 
= E = — 
À m 


where b is the same coefficient as in (12.74). 
Hence, the energy functional (12.82) has the form 


b 4 

E=w*A? +A. a+ 370 Vo- (12.85) 
0 

It remains to find the minimum of this functional with respect to remaining 

unknowns w, A and ro at given value of charge (12.83). One obtains from (12.83) 
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that w = Q/(2A”). Then minimization of (12.85) with respect to A gives 


_ Qro 
= 
and the frequency is w = b/ro, as one should expect. The energy as a function of 
the only remaining parameter ro is given precisely by Eq. (12.75). Further analysis 
coincides with that given above, so we see that classical field theory approach is 
indeed equivalent to quantum mechanical one. 

Let us discuss the simple mechanism of the cosmological production of Q-balls as 
dark matter candidates. Assume that the field x has zero expectation value at high 
temperatures, and at some critical temperature T, it develops nonzero expectation 
value Xe due to first order phase transition. Let us assume further that the ¢-particle 
mass is large in the new phase!” 


A? 


m¢(Te) = hXe > To. (12.86) 


We assume finally that before the phase transition, the Universe is Q-asymmetric, 
with the asymmetry characterized by 


Ag= 2 a (12.87) 
S Ng + Ng 
(We assume that ¢ is the lightest particle carrying the global U(1) charge.) 

As we discussed in Sec. 10.1, first order phase transition occurs via nucleation 
of a few bubbles of new phase per Hubble volume; these bubbles then expand 
and eventually percolate. As a result, there remain islands of the old phase x = 0. 
These islands are precursors of @-balls, and, indeed, entire Q-charge of the Universe 
concentrates in these islands. The reason is that because of (12.86), ¢-particles 
cannot penetrate from regions with x = 0 to regions with Xx = Xe. Hence, the 
islands of old phase carry large charges Q. To make an estimate, we notice that the 
number of Q-balls right after the phase transition is of the order of the number of 
bubbles of new phase at percolation. We make use of the result (10.19) and write 


ng (Te) 3 Te 
“vv gx * 3? 
s(Te) Mp 


where, in notations of Sec. 10.1 
3/2 
B= mete >. 
Every Q-ball collects the charge from volume of order ngs so the typical @-ball 
charge is 
Q ~ ng" (Te)s(Te)Ag. (12.88) 


15 The latter assumption is in fact non-trivial. According to (10.40), (10.45) it implies, in particular, 
that. self-coupling of ¢ is small, A < h4. 


12.7. *Non-topological Solitons: Q-balls 417 


We make use of Eq. (12.77) and estimate the mass density of Q-balls in the present 
Universe: 


i ig T 3/4 
poo = Mg: ngo ~ WAY B R TE) - 50. (12.89) 
? , Mpi 


Finally, we set v * and T, roughly equal to the energy scale v, recall that so ~ 
3000 cm~? and set gą ~ 100. We get 


3.1079. EA i ^s j — "A GeV 
PQ,0 106 10-10 1 TeV cm?” 


We see that Q-balls are dark matter candidates, provided that the energy scale v 
is not very high: as an example, for Ay ~ 10~'° (which is similar to the baryon 
asymmetry) the required value pQ, = popm ~ 0.25pe ~ 107° GeV/cm? is obtained 
for v in (a few)-10 TeV range. More accurate estimates [263] give for Ag ~ 10710 


v = 1 — 30 TeV. 


The present number density of dark matter Q-balls is 


1 
neo = (108 — 109 cm)?’ 


i.e. the typical distance between Q-balls is of order of thousand kilometers. Mass 
and charge of typical Q-ball are in the range 


Mo =10° -107 g, Q= 10” — 10”, 
and the estimate for Q-ball size is 


An Asso 


ro =2.107!f cm, for Ag = 107". 


3 PCDM 
Note that the latter estimate is practically independent of the parameters of the 
model. 


Problem 12.22. Obtain the latter estimate. 


Problem 12.23. In the scenario considered in the text, ġ-particles and their 
antiparticles from high energy tail of momentum distribution may penetrate the 
new phase even if the condition (12.86) is satisfied. Let us neglect ¢-annihilation 
in the new phase (which is a reasonable approxomation, given the high energy scale 
of the model; see Sec. (9.2)). These particles @ and œ are not sitting in Q-balls, but 
stay in the Universe intact until today. Find the bounds on the scenario from the 
requirement that the present mass density of free particles and ¢ does not exceed 
pcp and thus refine the inequality (12.86). 
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12.7.2. Models with fiat directions 


Somewhat different class of Q-balls [253] exists in models with sufficiently flat scalar 
potentials. The simplest of these models contains single complex scalar field and has 
the Lagrangian 


L = dG auh — V (G*$), (12.90) 


where the potential V (¢*¢ġ) has absolute minimum at the origin; its other properties 
will be specified later on. This Lagrangian is invariant under the U(1) symmetry of 
global phase rotations (12.72), and vacuum ¢ = 0 does not break this symmetry. In 
what follows we set V(0) = 0, so that vacuum has zero energy. 

Field equation in the model (12.90) is 


OV 
OKO, — =0 12.91 
+a pn ( ) 
and the total energy is given by the integral 
E= fias + |0:6|? + V(¢*p)]d?x. (12.92) 
The mass of ¢-particle in vacuum ¢ = 0 is 
32V 
= ,/ ———(0). 12. 
soa) (12.93) 


In analogy with (12.80), let us try to find a Q-ball solution to Eq. (12.91). This 
solution should carry global charge (12.73), so one expects that it oscillates (in 


pl 


internal space) and is spherically symmetric, 
p= et f(r), g" = et f(r). (12.94) 
By substituting this ansatz into Eq. (12.91) we find 


Ëf 2d d 


1 1 
org et), aia 


Formally, this equation coincides with the equation for classical particle of unit mass 
with coordinate f, moving with friction in the potential 
l op 
Valf) = 50S? - 5V(f), 


while the radial coordinate plays the role of time. We find the asymptotics of f(r) 
at large r by requiring that energy (12.90) is finite. The energy functional for the 
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configuration (12.94) has the form 


E= an fp + (Of)? +V(f)]r2dr. (12.96) 
Its finiteness requires that 
f(r> ow) 0. (12.97) 


This means that the solution we search for is localized, as should be the case for 
Q-ball. 
The behavior near r = 0 is found by requiring that “friction force” in Eq. (12.95) 
is finite. This gives 
d 
FG —> 0) ae, e>0. (12.98) 
dr 
The Q-ball profile f(r) corresponds to the “particle” that starts to move at zero 
velocity at initial “time” r = 0 from some point 


fo= f(r =0) (12.99) 


and rolls down along the potential Veg(f) in such a way that the point f = 0 is 
reached in infinite “time”, f(r —> 00) — 0. 

Clearly, the existence of such a solution implies that, first, the effective potential 
Ver(f) has a minimum at f = 0, and second, the particle has positive energy at the 
start of its motion, since it experiences friction force and its energy is zero in the 
end. Hence, the “initial” value of the field must be such that 


V(fo) 
fo 
The region r Ș ro where f(r) is considerably different from zero, 0 < f(r) S fo, is 
the inner region of Q-ball, and ro is the Q-ball size. 
The charge of the Q-ball is given by 


=w lw’. (12.100) 


Q=snw f f’(r)r’dr = Ty iy (12.101) 


where the second equality is obtained by assuming that 
f(r) ~ fo- 0(ro— r). 


This is a good approximation in the thin wall regime, when the size Ar of the region 
where f(r) changes is small compared to the size of Q-ball itself, Ar « ro. We note 
that in the general case, the second equality in (12.101) can be considered as the 
definition of Q-ball size ro. 
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Assuming the thin wall regime, we estimate the Q-ball energy (12.96) as 
AT 37 9 po 
E m Tr B + VU) (12.102) 


where we neglected the contribution of the gradient term (V f)?. The latter comes 
from the Q-ball surface region, and is therefore small for large Q. The Q-ball size is 
obtained by minimizing the energy (12.102) with respect to ro at given value of the 
charge (12.101). To this end, we use (12.101) to express the frequency w through ro 
and insert the result into (12.102). This gives 


An 


ro V (fo) + 


PO _3Q? 
“3 


E 
l6nré l6nre fe’ 


The minimum of this expression with respect to ro is at 


4r 3 Q 1 


= 2fo/V Go) 


The energy of the Q-ball is 


= Zy V (fo), 


while the frequency w reaches the critical value wo given by (12.100). Finally, the 
energy of stable stationary solution must be at minimum with respect to the remain- 
ing parameter, the field value in the center fo. This is only possible if the function 
V(f)/f? has a minimum at finite f = fo 40, 


min ş SP = = Ken 0) #0, fo4#0, fox. (12.103) 
It is this value that the field takes in the soliton center. 

The properties that the potential V(f) has global minimum at f = 0 and at the 
same time that the function V(f)/f? has minimum away from zero are quite non- 
trivial, especially if one recalls that V(f) is actually a function of f? = ¢*¢. (The 
latter follows from invariance under global U(1).) As an example, the conventional 
renormalizable potential V(¢) = m?¢*¢ + A(¢*¢)? does not have these properties. 
The existence of @-balls needs more exotic scalar potentials which are sufficiently 
flat at least somewhere in the field space. We will briefly discuss in what models 
this is indeed the case, and here we proceed by assuming that the potential has the 
required properties. 
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The Q-ball stability condition is still given by (12.78), which we write as 


My ae EA (12.104) 

fo 
Making use of (12.93) we find that the latter condition is yet another non-trivial 
requirement imposed on the scalar potential, 


2V (fo) _ dV (0) 
R SP 
Provided the potential has properties (12.103), (12.105) and has global minimum 
at the origin, there exist stable Q-balls of the type we have described. 
We note that we have performed the analysis for large Q-balls, as we neglected 
the surface contribution to the energy. Detailed analysis shows that some models 
admit rather small Q-balls whose charges are not exceedingly larger than 1. 


(12.105) 


Problem 12.24. Find approximate Q-ball solution (assuming that Q is large) in 
a model with gauge group U (1). Show that the ratio E/Q increases with Q because 
of the Coulomb contribution to the energy. Show that at large charge the Q-ball is 
unstable against emitting charged particles from its surface. We note, though, that 
for sufficiently small gauge coupling, there is a range of Q-ball charges in which 
Q-balls are stable in models with gauge U (1) symmetry. 


Let us now consider the case when the function V(f)/f? does not have a min- 
imum at finite f. If V(f) increases with f slower than f?, then the minimum of 
V(f)/f? is at f — oo and the value at minimum is zero, 


mee [VQ i oe d 

ming | ——| = | —— =). 
l J” q? fro 
This is possible only in models with very flat scalar potentials. In this case our 
previous analysis does not go through. However, in the case of very flat potential, 


(12.106) 


frow: Vif)xf®% a<2, 


the scalar potential can be neglected in Eq. (12.95) and there exists approximate 
solution obeying all above conditions (12.97)—(12.99), 


sin wr 


f(r) = fo 


This soluton is continuous across the Q-ball boundary r = ro provided that the 
soliton size is related to the oscillation frequency, 


-O(r9 — r). (12.107) 


ro = —. (12.108) 


W 


This is in contrast with the case (12.103). The form of solution (12.107) shows 
that the thin wall approximation is not valid, as f(r) varies in the entire region 
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r < ro. Nevertheless, the solution is explicitly known, so we can proceed further. 
By expressing the frequency in terms of the size, we find the charge of the Q-ball, 


Q=4707., (12.109) 
and its energy (12.96) 
4 
E =4Anf2ro + SrsbV (fo), (12.110) 


where the parameter b is of order 1; it is determined by the shape of the scalar 
potential. We now have to minimize the energy with respect to parameters fo and 
ro at given value of the charge (12.109). By expressing ro through fo from (12.109) 
we find energy as function of fo, 


3/2 
E( fo) = 2mfoQ’? + ae V(fo)- (12.111) 
0) 
Its minimum occurs at 
bd a) at 
anl R) T a 


The left-hand side decreases as function of fo for potentials under discussion, so the 
solution fo(Q) to Eq. (12.112) increases with Q. In particular, for V(f) ~ f% we 
have from (12.112) that 

1 


fo(Q) x QF=, 
and the energy (12.111) behaves as 


E(Q) « Q5. 
For a < 2 the exponent here is smaller than 1, so the Q-ball energy is smaller than 


energy of Q particles at least for large Q; the Q-ball is stable. We note that in the 
limit a — 0 (very flat potential at large fields), the energy behavior is 


Rage. (12.113) 


This is analogous to (12.77). 

Q-balls exist in some realistic extensions of the Standard Model, including super- 
symmetric ones [254-257]. Our analysis goes through in these models, except for 
one important point. The analogs of our ¢-particles are usually not the lightest 
particles carrying Q-charge, so the bound (12.104) gets modified. It is the mass of 
the lightest charged particle that enters the right-hand side of the bound (12.104) 
in realistic models. 

The charge Q in realistic theories may be baryon number, lepton number or 
a combination thereof. Flatness of the scalar potential is a natural property of 
supersymmetric theories, which often have flat directions at the tree level. Quan- 
tum corrections lift the flat directions, but the potential remains nearly flat, since 
quantum corrections give weak dependence on the field, V œ log |¢ġ]. 
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In SUSY theories, Q may be the baryon number, and then the field ¢ is a 
combination of squark fields. The parameter m in the generalized formula (12.104) is 
the proton mass in that case. If Q is lepton number, then the field ¢ is a combination 
of sleptons, and m in (12.104) is neutrino mass. In models with potentials of the 
type (12.106), Q-balls are stable on cosmological time scale for large values of their 
charge only. 


Problem 12.25. Estimate the charges of stable B-ball and L-ball (Q being baryon 
and lepton number, respectively) in models with potentials of the type (12.106). For 
numerical estimate consider the following potential at large f, 


V(f) = (1 TeV)* - (f/1 TeV)“, 
and study the cases a = 1 and a —> 0. 


Problem 12.26. Let us consider a model with Q-balls in which scalar particles are 
unstable against decay into massless fermions. Q-balls in this model are unstable, 
but may have long lifetime, since because of Pauli blocking, fermions evaporate from 
the Q-ball surface rather than from its interior. Assuming that the only parameter 
determining the evaporation rate per unit area is the frequency w, find the charge for 
which the Q-ball lifetime exceeds the age of the Universe. Make numerical estimates 
for the same potentials as in the previous problem. 


Properties of large Q-balls are affected by gravity. At very large Q these objects 
are gravitationally unstable and collapse into black holes. 


Problem 12.27. Estimate the critical charge above which Q-balls collapse into 
black holes in models of the types (12.103) and (12.106). Take all dimensionful 
parameters of order 1 TeV for numerical estimates. Compare with the results of 
previous two problems. 


The dominant mechanism of Q-ball production is the decay of flat directions 
of the scalar potential, moduli fields [256, 258]. In the limit rọ — oo our Q-balls 
degenerate into homogeneous scalar condensate filling the entire space. It can be 
formed in much the same way as the condensate of the Affleck—Dine field (see 
Sec. 11.6) but now this condensate carries certain density of U(1) charge. As we 
will see below, the condensate is actually unstable; it decays by producing very 
different charge densities in different places in space. The regions with high charge 
density eventually evolve into Q-balls, while in regions with low charge density the 
charge is carried by free particles. Detailed analysis shows that this mechanism 
indeed efficiently produces Q-balls, so that they carry fairly large fraction of the 
charge initially contained in the condensate. 

Let us consider this mechanism in some detail. We have seen in Sec. 11.6 that 
the asymmetry (here we are talking about asymmetry with respect to global U(1) 
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charge Q) is produced in narrow time interval near the epoch at which the slow roll 
conditions get violated,!® 

V' (dy 

Vis) H*(¢,), (12.114) 


7 


where ġ; is the initial value of the field, and subscript r refers to the end of slow 
roll regime, cf. Sec. 11.6. Let small explicit breaking of U(1) result in Q-asymmetry 


where ng is the Q-charge density. Unlike in the previous Section, the Q-charge 
density is due to the field ¢ itself, as this field evolves as shown in Fig. 11.17. 

Let us first show that the homogeneous scalar condensate carrying @Q-charge is 
unstable against production of inhomogeneities. To this end, we neglect the cosmo- 
logical expansion (we will discuss this point later on) and write the scalar field as 


gla) = fla) t, 


The field equations for f and a are 


ä- Aa + Saf -5Va-VF=0, (12.115) 
F- Af- fà + faat+V'(f) =0. (12.116) 


Let us begin with the situation in which the scalar field rotates along a circle in 
internal space, 


f= const, a=wt, w = const, (12.117) 


and it follows from (12.116) that 
Bice (12.118) 


Note that at t > tp, when slow roll conditions are violated, one has w > H. In the 
end, this justifies our approximation of static Universe. 

Let us consider linear perturbations about the homogeneous condensate 
(12.117). We ask whether there exist real-valued perturbations that exponentially 


16We use here the general relation (4.57) rather than (4.58) valid for power-law potentials. 
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grow in time. Making use of translational invariance we write 
df (x) = df - e™ cos(px), 


(12.119) 
da(x) = da - e** cos(px). 


Equations (12.115) and (12.116) give 
(2Qw) - df + f- (X? +p?) - da =0, 
(A? + p? + V"(f) —w?)- df — (2Afw)- da =0. 
We combine these equations to obtain the equation for eigenvalues at each p?, 
r+? (2p? + V” + 3w?) +p (p? +V” —w?) =0. (12.120) 
It is straightforward to see that all its solutions A? are real. For 
p? +V” -w <0, (12.121) 


one of the roots is positive, à? > 0. This means the instability of the condensate: 
some modes (12.119) do increase exponentially in time. Importantly, these modes 
are not homogeneous in space: all values of À are real at p = 0. 

Making use of (12.115) we find that the condition (12.121) can be satisfied if the 
potential is sufficiently flat, 


Ve, (12.122) 
f 
The latter inequality is valid for power-law potential, V x f“, at a < 2, i.e., when 
the model admits Q-balls. The strongest instability (the largest A?) occurs for 


Ip] ~ w, (12.123) 


and the exponent is A ~ w. This is the final justification of our static Universe 
approximation. 

The development of instability in non-linear regime is difficult, if not impos- 
sible, to study analytically. Still, the very fact that the condition for instability 
(12.122) coincides with the condition for the existence of Q-balls suggests that the 
process ends up by @-ball formation. Numerical simulations support this conclusion 
(259, 260]. The estimate (12.123) shows that Q-charge is collected into Q-ball from 
the region of volume 


|p|? ~ wr. 


Both evolution of the condensate and the development of instability occur right 
after the end of the slow roll regime, so a crude estimate is 


IP] ~ Al~ w ~ H(t), 


where we made use of (12.114) and (12.118). Thus, of order one Q-ball is produced 
per Hubble volume at time tr. 
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Further estimates are quite similar to our estimates in the end of Sec. 12.7.1. 
Let us consider for definiteness almost flat potential at large f, 


f a 
V(f) = v4 (4) negi 
where v is a parameter of dimension of mass. In this case one has the estimate 


(12.113), i.e., the Q-ball mass is of order 
Mo ~ vQ3"*. 


The results of Sec. 12.7.1 are directly translated to our model, and the parameter 
6 is of order 1, since in our case no(t) ~ H~%(t,). We obtain (see (12.89)) 


3/4 
3/4 1/8 [ Tr 


The difference, though, is that the temperature T, is determined by (12.114), so 
that in the limit of small a we have 


M 
Tp ~ v an 
fi 


This introduces additional uncertainty into the estimate of the present Q-ball mass 
density related to the unknown initial value of the field ¢. If we set f; ~ Mp; as in 
Sec. 11.6, then the results of Sec. 12.7.1 with 8 = 1 are literally valid. With appro- 
priate choice of parameters, Q-balls of our model serve as dark matter candidates. 

If Q-charge is baryon number, then Q-balls may play a role in the generation 
of the baryon asymmetry [257]. As an example, B-balls may carry baryon number 


through the epoch when electroweak baryon and lepton number violating interac- 
tions are at work and tend to wash out the asymmetry. 

Of particular interest are B-balls which are unstable at fairly late cosmological 
epoch. Their decays transmit their baryon number into quarks. This is basically 
a version of the Affleck-Dine mechanism. Most intriguing is the situation when 
Q-balls are unstable and decay into baryons and new stable particles, dark matter 
candidates. Such a mechanism may possibly explain approximate equality between 
the present energy densities of baryons and dark matter. As an example, the decay 
of squark condensate produces three neutralino LSP per baryon. If most of baryon 
number initially was in Q-balls, one obtains a simple relation between mass densities 
of baryons and LSP dark matter, 


PB Mp 
2E ae 


. 12.124 
PopM = 3MzgP ( ) 


For realistic LSP masses, mgzsp ~ 10-100 GeV, this is only two or three orders of 
magnitude smaller than the real value. We note in this regard that the neutralino 
density may get diluted by their annihilation near Q-ball surface. Also, the correct 
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ratio ps/Pcpm is obtained in models where LSP is a gravitino of mass of order 
1 GeV. 

To end this Section we note that the class of non-topological solitons is not 
exhausted by Q-balls. This class includes also quark nuggets, cosmic neutrino balls, 
soliton stars and other hypothetical objects. 
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Fig. 13.1. Spatial distribution of galaxies and quasars from the analysis of early observational 
data of SDSS [4]. Green dots mark all galaxies (within a given solid angle) with brightness (appar- 
ent magnitude) exceeding a certain value. Red dots show galaxies of a special type (Large Red 
Galaxies), which are very luminous and form fairly homogeneous population; in the comoving 
frame their spectrum is shifted towards the red wave-band as compared to ordinary galaxies. 
Turquoise and blue points show the positions of ordinary quasars. The value of parameter h is 
about 0.7 (see Sec. 1.2.2). 
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Fig. 13.2. WMAP data [9]: angular anisotropy of CMB temperature, i.e., variation of the tem- 
perature of photons coming from different directions in the sky (shown by color). The average 
temperature and dipole component are subtracted. The observed variation of temperature is at 
the level of ôT ~ 100 uK, i.e., 6T/To ~ 1074—1075. 
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Fig. 13.3. CMB temperature anisotropy measured by various instruments [8]. The theoretical 
curve is the best fit of the ACDM model (see Chapter 4) to the WMAP data; this fit is in 
agreement with other experiments as well. 
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Fig. 13.4. Cluster of galaxies CL0024 + 1654 [22]. Blue color in the left panel illustrates dark 
matter distribution; elongated blue objects in the right panel are multiple images of a galaxy 
behind the cluster. 


Fig. 13.5. Observation of “Bullet cluster” 1E0657-558, two colliding clusters of galaxies [24]. Lines 
show gravitational equipotential surfaces, the bright regions in the right panel are regions of hot 
baryon gas. 
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Fig. 13.6. Regions in the plane of parameters (Qm, w) allowed (for the flat Universe) by observa- 
tions of CMB anisotropy, by large scale structures (BAO) and by SNe Ia data [67]. The intersection 
region corresponds to the combined analysis of all these data. Regions of smaller and larger size 
correspond to 68.3%, 95.4% and 99.7% confidence level, respectively. 
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Fig. 13.7. Experimentally forbidden and cosmologically favored regions [130] in the plane 
(My/2,mo) in mSUGRA model with tan 8 = 10 and tan = 50. (A is chosen to be zero at 
the scale Mcur % 10!6 GeV.) Forbidden regions in each plot: the region left of the thick blue 
dashed line is excluded by the bound on light chargino mass my+ > 104 GeV, the region in the 
lower left corner left of blue dash-dotted line is excluded by the bound on the light slepton mass 
me > 99GeV, the region left of red dash-dotted line is excluded by the bound on the lightest 
Higgs boson mass, the green region in the left part is excluded by the measurement of b — sy 
decay width, the light pink strip is excluded by the measurement of the muon anomalous magnetic 
moment. The brown region on the right is cosmologically unfavored, since it corresponds to charged 
slepton LSP (mostly stau). Neutralino dark matter regions are shown in light blue; the neutralino 
mass density range in the left and right plots are 0.1 < Quh? < 0.3 (conservative estimate) and 
0.094 < Qnh? < 0.129, respectively. In the bulk of the experimentally allowed region, neutralinos 
are overproduced. 
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Fig. 13.8. Left: The same as in upper part of Fig. 13.7 but for larger range of masses. The 
pink upper left region is excluded by the requirement of electroweak symmetry breaking. Right: 
Cosmologically favored regions (0.094 < Qyh? < 0.129) for various values of tan 3 = 5,10,..., 55; 
lower strips correspond to smaller tan 8 [130]. 
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Fig. 13.9. Bounds [136] on the plane (M1/2, mo) in mSUGRA model with m3/2 = 100 GeV and 
two values of tan 8. The region where gravitino is LSP is to the right of the solid black line. The 
green shaded region is ruled out by b — sy decays. The region to the right of the solid red line is 
consistent with BBN. The blue dash-dotted line divides the regions with neutralino NLSP (upper 
parts of figures) and the lightest slepton (lower parts). In the blue shaded region, NLSP, if stable, 
would have the right dark matter mass density (0.094 < Qh? < 0.129). The present gravitino mass 
density coincides with the dark matter mass density at pink dashed lines. The gravitino mass 
density is smaller than observed Qppm below this line (left panel) and between these lines (right 
panel). These reagions are cosmologically allowed in this model. 
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Fig. 13.10. Allowed region of parameter space for oscillations ve +> / obtained from solar neutrino 
experiments and KamLAND [305]. 
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Fig. 13.11. Allowed region of parameter space for oscillations vy, << vr [307]. Lines encircle areas 
allowed by T2K at 68% and 90% C.L. Shaded regions show the results (at 90% C.L.) of MINOS 
and atmospheric neutrino study at Super-K. The results are presented for normal and inverse 
hierarchy of neutrino masses, NH and IH, respectively. Dots show the best fit values for the two 
hierarchies. 
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Fig. 13.12. Allowed regions of parameter space for De < Ďe oscillations determined by Daya Bay 
reactor neutrino experiment [306]. Dashed lines show the results obtained by analyzing solely the 
antineutrino energy spectrum. Horizontal line Am? p corresponds to the measurement of the same 
parameter by MINOS. 
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Fig. 13.13. Hubble diagram for distant objects [75] (2011). Solid curve is the best fit for spatially 
flat ACDM model. 


Color Pages 437 


0.75 — +TE+EE 


68 
MO +lensing 
MM +lensing+BAO 64 
0.60 so 
56 = 
< © 
G 52 
0.45 
48 
44 
0.30 40 
0.30 0.45 0.60 0.75 
Qn 


Fig. 13.14. Allowed region in the space of parameters (Qaz,Qa) from CMB anisotropy data 
(+TE+EE, lensing) and data on baryon acoustic oscilations (BAO), the preferable values of the 
present Hubble parameter Ho are indicated in color [94]. Shown here are parameter regions allowed 
at 68 %-95 % C.L. 
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Fig. 13.15. Limits on the sum of neutrino masses obtained under various assumptions on other 
cosmological parameters and using different cosmological datasets [104]; Ho (left) is the present 
value of the Hubble parameter, Nesp (right) is the effective number of neutrino species. 
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Fig. 13.16. Excluded regions in parameter plane (Mx, px) [162] for interactions independent of 
nucleon spin (left) and dependent on nucleon spin (right). Regions above the curves are excluded at 
90% C.L. CMS: search at Large hadron collider (under assumption of contact interaction X X fi fa, 
see the main text); IceCube and Super-K: search for a signal from dark matter annihilation in 
the Sun; other data: direct search for elastic scattering of cosmic dark matter in low background 
detectors. Shaded region in the central part of right figure corresponds to possible signal announced 
by CDMS experiment. 
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Fig. 13.17. Exclusion regions in the plane (Mx, 0%") [318] for dark matter particles annihilating 
through indicated channels. The light gray line is the cross-section at freeze-out in the early 
Universe. Regions above the lines are ruled out at 90% C.L. AMS-02: search for antiparticles; Fermi 
LAT: search for photons from dark matter annihilation to ut u~ and 7+7~ (upper and lower lines), 
WMAP?7: limit from possible annihilation into charged lepton pair before recombination (upper 
range Ttr, lower range e+e~ ), which would affect CMB. As an example, the uncertainty for the 
channel et+e~ is shown as shadow region. 
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Fig. 13.18. Parameter ranges for various dark metter candidates [164]: their masses Mx and 
elastic scattering cross-sections off nucleon Gint; lpb = 10-36 cm? (conversion cross-sections for 


axion and axino). Red, rose, blue and dark blue show hot, warm, cold and ultracold dark matter, 
respectively. 
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Fig. 13.19. Predictions of MSSM for spin-independent cross-section of elastic scattering of neu- 
tralino off nucleons [165]. Color regions correspond to parameters consistent with accelerator data 
and yielding Qy =% 0.25. The blue region is ruled out by direct search experiment XENON 100. 
The neutralino annihilation cross-section is enhanced in green regions by resonant annihilation 
through Z-boson or one of the Higgs bosons of MSSM, when My ~% Mz/2 or My ® my/2; 
see discussion below. The suppression of neutralino abundance in yellow and magenta regions is 
due to approximate degeneracy of neutralino and some other superpartner leading to efficient co- 
annihillation of the neutralino with this superpartner. Other mechansims that suppress neutralino 
abundance operate in gray regions; they require fine tuning of parameters too. Shown here are 
also the regions of possible signals in DAMA, CoGENT, CRESST and CDMS experiments and 
planned sensitivity of LUX and XENON 1T. 
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Fig. 13.20. Experimentally forbidden and cosmologically favored regions in the plane (Mj/2, mo) 
in mSUGRA model [166] with Ao = mo and p > 0 (left) and CMSSM [167] with Ap = —my/2, 
tang = 30 and pu > 0 (right). Shown here are lines of constant mass of the lightest Higgs boson 
(realistic interval is 125-126 GeV. In the left panel, the elongated blue region is consistent with 
neutralino dark matter and has m 1/2 % 1300 GeV, mo ~ 850 GeV; the region above the blue strip 
and brown region below are cosmologically ruled out (neuralinos are overproduced and LSP is a 
charged superpartner of tau-lepton, respectively); below the brown region LSP is gravitino (see 
disussion in Sec. 9.6.3), the region to the left of solid green line is excluded by the LHC data; gray 
lines show the values of tan 8. In the right panel, the cosmologically allowed region is shown in red, 
in the black region where m1/2 © 1800 GeV, mo © 7000 GeV neutralino abundance coincides with 
that of dark matter; in the green region LSP is the electrically charged superpartner of t-quark. 
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Fig. 13.21. Same as in Fig. 13.20, left, but for model [166] with an additional parameter in the 


Higgs sector as compared to CMSSM. The magenta region in the left upper corner is ruled out 
by the requirement of spontaneous breaking of the electroweak symmetry. The neutralino is dark 


matter particle in the blue region. 
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Fig. 13.22. Experimental constraints on the parameters of models with dark matter axions [170]. 
The inclined line “KSVZ axion” is the prediction of the KSVZ model; the yellow band around it 
shown predictions of the majority of axion models. 
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Fig. 13.23. Electron neutrino survival probability as function of energy [297]. 


Appendix A 


Elements of General Relativity 


A.1. Tensors in Curved Space-Time 


In this Appendix we introduce the basic concepts of General Relativity. Our presen- 
tation does not pretend to be mathematically rigorous and comprehensive; its main 
purpose is to introduce the notions used in the main text and to gather in a single 
place a number of useful relations and formulas. A more systematic treatment of 
General Relativity (GR) is given in the textbooks [83, 264, 265]. Readers, interested 
in a more mathematically rigorous description of differential geometry, are invited 
to consult the book [266]. Conventions and notations used in this book are collected 
in Sec. A.11. 

The main object of study in GR is curved 4-dimensional space (manifold) M 
describing space-time. For better understanding of abstract mathematical notions 
introduced below, it is sometimes helpful to think of this space as embedded into a 
flat enveloping space (ambient manifold) of higher dimension. We emphasize, how- 
ever, that our space-time is embedded nowhere,! and we never rely on the possibility 
of such an embedding. It is worth noting here that all definitions and facts contained 
in this Appendix are straightforwardly generalized to (pseudo-)Riemannian spaces 
of arbitrary dimension. 

The interval (square of the invariant distance) ds? between two nearby points 
in space-time is? 


ds? = gy (x)dx" da”, (A.1) 


where indices u, v take values 0,1,2,3, and metric g,,,(x) can be considered as a 
4x 4 symmetric matrix. Thus, the metric is determined by ten independent func- 
tions of the coordinates g,,,(x), y < v. Hereafter we assume that the metric has the 
signature (+,—,—,—), i.e., that the matrix g,,,(x) has one positive and three nega- 
tive eigenvalues at each point x. Vectors dx" of positive, zero and negative values of 
ds? correspond to time-like, light-like (null) and space-like directions, respectively. 


1We are not discussing here models with extra spatial dimensions. 
2In what follows, unless otherwise stated, summation over repeated indices is implied. 
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The basic principle of GR is that all choices of the local coordinate frame are 
equivalent to each other. It is therefore natural to consider the functions (fields) on 
the manifold M, which transform in a certain way under coordinate transformation, 


gt — r(x”). (A.2) 


The simplest example of such an object is scalar field (x), whose defining property 
is that it transforms as 


g'(x) = (x). 


Hereafter all quantities with prime belong to the new coordinate frame, while quan- 
tities without prime refer to the original one. The above relation shows that the 
value of the field at a given point of the manifold does not change under coordi- 
nate transformation. Another important example of an object, “well” -transforming 
under coordinate transformations, is a contravariant vector: a set of four functions 
A!” (x), transforming in the same way as small increments of coordinates da", i.e., 


O Ox!” 
~ Oat 


The covariant vector is a set of four variables A,,(z) which transform in the same 


A” (x") 


A" (x). (A.3) 


way as the derivatives ao Lë 


Ox" 
Using the transformation laws (A.3) and (A.4), it is easy to obtain the transforma- 


tion law of the contraction A" B,,, 


Al, (a!) 


Ox'* | ôx’ 
A” (2!) Bi (2') = Ae A (2) azr Ba) 


= A” (x) B, (2). (A.5) 


We see that this contraction transforms as a scalar, i.e., its value at each point does 
not depend on the choice of coordinates. 

From the point of view of geometry, contravariant vector A” (x) can be thought of 
as a tangent vector to the surface M, if the latter is embedded into some enveloping 
space. For example, a derivative of a scalar function ¢(a) along the direction defined 
by the tangent vector A“(x) has the form 


dad(x) = A" (x)ð p(z). (A.6) 


Invariance of contraction (A.5) under coordinate transformations shows that covari- 
ant vectors B(x) can be regarded as linear functionals that map, via contraction, 
the tangent space to real numbers. 
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Similarly, we can define tensor with an arbitrary number of upper and lower 
indices. Such an object transforms in the same way as the product of an appropriate 
number of covariant and contravariant vectors. For example, tensor BE \ transforms 
under coordinate transformations as follows, 


orr OxT Ore o 
"Bat Ae” Age 7p(2). 


Directly generalizing the above result for contraction of covariant and contravariant 


Bjala’) 


vectors, it is straightforward to prove that by contracting an upper and a lower 
indices in tensor of arbitrary rank we again obtain tensor. 

From the fact that the interval ds? defines the distance between two points, which 
is independent of the choice of coordinate frame, it follows that metric gay (x) is a 
covariant tensor of the second rank, i.e., it transforms as 


Ox Ox? 
Tv (2) = Dale gy7 90 (2)- (A.7) 


Problem A.1. Prove the transformation law (A.7). 


Another important example of second rank tensor is the Kronecker 6-symbol ð% 
defined in an arbitrary coordinate frame as unit diagonal matrix, 


oy, = diag(1, 1,1,1). 


Let us check that this definition of 67 is consistent with the tensor transforma- 
tion law. If a tensor is equal to ð% in an original coordinate frame, then in a new 
coordinate frame it is equal to 


On On? y de” ðr’ 
On Ox” P Oa Ax!” 


The right-hand side here is again the Kronecker symbol 6/, so that ô} is a ten- 


vy? 


sor indeed. Starting from the metric tensor g,, and the Kronecker tensor ô} one 
can define a new contravariant second rank symmetric tensor g”” by the following 
equality, 


g” gua = OY. (A.8) 
In other words, the matrix g”” is inverse to the matrix guv. 
Problem A.2. Prove that g'” is indeed a tensor. 


Making convolutions with tensors guy and g””, we can define the operations of 
raising and lowering of indices. For example, 


Av = g” Ap, Buw = Gurgu BP 


If A,, and B% are tensors, then A” and B,, are tensors too. 
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Another important object needed to construct the action functional in GR is 
the determinant of the metric tensor, 
g = det guv. 


In order to determine how g transforms under coordinate transformations, we write 
the transformation law (A.7) in the matrix form: 


d'(a’) = Jala) j7. (A.9) 
Hats indicate that all objects in Eq. (A.9) are matrices of size 4 x 4. The symbol J 
denotes the Jacobian matrix corresponding to the change of coordinates (A.2), 
Ox! 
H = Ā— 
Jy ôx”? 


while J7 is the transposed matrix. Equality (A.9) implies the following transforma- 
tion law for g, 


g'(x’) = J°g(z), (A.10) 


where J is the Jacobian determinant of the coordinate transformation (A.2), 


ox” 


It follows from the transformation law (A.10) that the product 
J—gd' x 


defines the invariant 4-volume element. Since the matrix g,, has three negative 
and one positive eigenvalues, the determinant g is negative. Therefore, the quantity 
yzg is real. 

In Minkowski space, in addition to the Kronecker delta, there is one more tensor 
which is invariant under Lorentz transformations. It is the Levi-Civita symbol e#”?”. 
We are reminded that «“”? is totally antisymmetric in its indices and, consequently, 
it is uniquely defined by the condition 


60128 =. 
However, the Levi-Civita symbol is not invariant under arbitrary coordinate trans- 


formations. Indeed, if a tensor is equal to e“”*” in a given coordinate frame, then 
in another frame it is equal to 


HON? — Ox!" Oa!” Ba! Oa!? coy 
Ox® Ox? Ox Ox? 

The transformation law (A.11) shows that the natural generalization of the Levi- 

Civita symbol to the case of arbitrary curvilinear coordinates and to curved space 


© ie se (A.11) 
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is the Levi-Civita tensor? 


1 
Bevre — cle | 
Vg 


This tensor is completely antisymmetric in all its indices and reduces to e#”*? when 
metric guy coincides with the metric of Minkowski space in Cartesian coordinates, 
Nav = diag(1, —1, —1, —1). 


A.2. Covariant Derivative 


In order to build the action invariant under arbitrary coordinate transformations, 
we have to define the covariant differentiation operator V, that converts tensors 
into tensors. For scalar field, it is natural to demand that this operation coincides 
with the usual differentiation, 


VuG(x) = ð olz). (A.12) 


It follows from the definition (A.4), that the derivative V,,¢ is covariant vector. 
One cannot define the covariant derivative of vector field A“(x) in the same way. 
In order to differentiate vector field, one must learn how to subtract tangent vectors 
belonging to different points of the space M. Therefore, we have to define the rule 
of parallel transport of vectors from one point to another. 
Consider the parallel transport of contravariant vector A“ from the point with 
coordinates x“ to the point with coordinates 


W =o pd". 
(See Fig. A.1.) Imposing the natural requirement of linearity (the sum of two vectors 


upon parallel transport becomes the sum of images), we observe that to the leading 
order in the increments of coordinates dz”, the image A” of the vector A“ has the 


AL 


Fig. A.1. Parallel transport of a vector. 


3More accurately, E“”*? is pseudotensor because it transforms with “wrong” sign under the coor- 
dinate transformations which change the spatial orientation (i.e., transformations with J < 0). 
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following general form 


ÄH (2) = AH (x) — TË, (a) A" (x) da. (A.13) 


The quantities I’, entering here are called connection coefficients. To determine 
the transformation law of the connection coefficients under arbitrary coordinate 
transformations, we perform coordinate transformation in both parts of equality 
(A.13) and make use of the fact that the quantities A”, A“ and da” transform 
according to the law (A.3). The left-hand side of (A.13) transforms into 


ara) = ED) pra) = (O y FO aes) are 


Ox” Ox” Ox” Ox* 
OBE) Giga: 9 OG) aX ay 


where we work to the linear order in dx“. Upon coordinate transformation, the 
right-hand side of (A.13) takes the following form, 


AH (a!) — T (2") Al (2! da" = 
(A.15) 


Equating the results of the transformations (A.14) and (A.15), convoluting both 
sides of the resulting equality with a , and comparing the result with the original 
rule of parallel transport (A.13), we obtain the following transformation law for the 


connection coefficients, 


TP (a!) = Ox? Ox? Ög E + Brt Ore , (A.16) 
Ox!” ðr Ogs P Ox? Ox Ox 
The second term on the right-hand side of (A.16) shows that connection is not 
tensor. 
To define the covariant derivative of vector field we transport the vector A# (x) 
to the point = x + dz, subtract the resulting vector from the value of the vector 
field at the point z and write 


A" (z) — Å" (7) = V, A" - dz’. 


Using the parallel transport rule (A.13), we arrive at the following definition of the 
covariant derivative of vector field A“ (x), 


V A” (x) = 0, AM (x) + Th, A(z). (A.17) 


The transformation law of the connection coefficients (A.16) guarantees that V, A” 
is a second rank tensor with one covariant and one contravariant indices. 
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The parallel transport rule of covariant vector B, follows from the fact that the 
contraction A“ B,, is a scalar transported in a trivial way, 


(A"B,)(#) = (A"B,)(2). (A.18) 


The relation (A.18) and parallel transport law of contravariant vector (A.13) give 
the following parallel transport rule for covariant vector B,,, 


B,(@) = B (x) + 1%) By(2)ds*. (A.19) 
Consequently, the covariant derivative has the form 
V B(x) = 0, B(x) — T} Ba (2). (A.20) 


Now that we have defined the covariant derivatives of scalar and vectors of both 
types, it is not difficult to generalize these definitions to tensors of arbitrary rank. 
This is done using the Leibniz rule, 


V,,(AB) = (V,A)B+ AV,B, 


where A and B are two arbitrary tensors whose indices are not explicitly written. 
For example, the covariant derivative of third rank tensor with one upper and two 
lower indices reads 


Va B% = ô Bz ar Toa Bs- ~ er _ T? „BX 


In principle, one could consider manifolds with arbitrary set of connection coef- 
ficients transforming according to the law (A.16). However, GR is based on the 
(pseudo-)Riemannian geometry,* in which additional conditions are imposed on the 
connection coefficients I’,. The first of these conditions is that the operation of par- 
allel transport (or, equivalently, the operation of covariant differentiation) commutes 
with the operation of raising and lowering of indices. This means, in particular, that 


Juv VA” = Valgu A”) 


for an arbitrary vector A”. The Leibniz rule implies that this is possible only if the 
metric tensor g,, is covariantly constant, 


V agv =0. (A.21) 
More explicitly, this condition reads 
Ong» = Thupa + V4 ,,9up- 


Connections satisfying the condition (A.21) are called metric connections (since 
they are consistent with metric). The second condition imposed on the connection 


4Pseudo-Riemannian geometry differs from the Riemannian one by the signature of metric, which 
for the Riemannian geometry is Euclidean. We often do not pay attention to this terminological 
subtlety. 
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coefficients is the requirement that they are symmetric in lower indices, 
A Trà à 
CHT pig =v: (A.22) 

The transformation law of the connection (A.16) implies that CÀ, is a tensor (called 
torsion tensor in the general case). Hence, the validity of (A.22) does not depend 
on the choice of coordinate frame. Manifold equipped with metric and torsionless 
metric connection is precisely what is called Riemannian manifold, and in this case 
the connection coefficients are called Christoffel symbols. 

Equations (A.21) and (A.22) enable us to unambiguously express the Christoffel 
symbols in terms of the metric tensor, 


1 
Tia = z9” (Odor + O9pv — Opgux)- (A.23) 
We always assume in what follows that the equality (A.23) holds.° 


Problem A.3. Consider a 2-dimensional surface % embedded in 3-dimensional 
Euclidean space R?. The space R? induces metric in X : if yt (i = 1,2) are coordi- 
nates on the surface X, then the square of the distance between nearby points on X 
can be written as 


ds? = gisdy' dy’ , 

where the metric gi;(y) is uniquely determined by the requirement that ds be the 
distance in R?. There is an obvious definition of tangent plane at each point on 
the surface X; contravariant vectors, as discussed above, are vectors belonging to 
the tangent plane. Their components A‘(y) in the chosen coordinate frame on X 
can, for instance, be defined by the relation 

og 

Oy’ 


where (y) is a function on the surface 4, and ag is its derivative along the 


daly) = A'(y) 


direction determined by the vector A. Parallel transport of tangent vector along 
the surface X is naturally defined in the following manner (see Fig. A.2): first we 
transport the vector A from point y to point J as a vector in R? (yielding the vector 
A) in Fig. A.2), and then we project it onto the tangent plane at point y. Let the 
surface % be (locally) defined by the equations 


a? = f*(y’,y’), a@=1,2,8, 
where x®“ are coordinates in R?. 


(1) Calculate the components of the metric gi;(y). 


5In geometries more general than Riemannian, objects given by (A.23) are also called Christoffel 
symbols. Another term used for them is Riemannian connection. 
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Fig. A.2. Parallel transport of a tangent vector. 


(2) Calculate the Christoffel symbols Ts (y) on the surface X, which are associated 
with the above operation of parallel transport. 

(3) Show that the properties (A.21) and (A.22) are satisfied, i.e., the geometry on 
the surface X is Riemannian. 

(4) Propose a generalization of the parallel transport of a vector, such that the tor- 
sion tensor (A.22) is nonzero. Demonstrate this property by explicit calculation 
of the connection coefficients. Does the relation (A.21) remain valid? 


Problem A.4. Derive the formula (A.23) from Eqs. (A.21) and (A.22). 


Problem A.5. Check the following properties of the Christoffel symbols and covari- 
ant derivative: 


Tý, = Vln V—g, (A.24) 
v 1 
g Th = Foul g), (A.25) 
1 
V,A" = Tm (A.26) 


for antisymmetric tensor AM” : 
1 
V pAb’ = —— 0, (vV —gA"”), A.27 
H /-9 PA ) ( ) 


for scalar Q : 


1 V 
ViVi = Ta AN 0,9), (A.28) 
where V} = g” Vo. 


The property (A.26) leads to the following generalization of the Gauss formula, 
JOAN gdz = J OANE da = J| Nia, 


where d}, is the element of surface which bounds the integration region. Together 
with the Leibniz rule for the covariant derivatives, this formula allows for integration 
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by parts of invariant integrals. For example 


[Aw Be =gdx =— [OAB yada + surface terms. 


To conclude this Section, we note the following fact. By choosing a suitable 
coordinate frame, we can locally, at a given point, set all Christoffel symbols equal 
to zero; this is fully consistent with the equivalence principle because this enables 
us to switch off locally the gravitational field. In this coordinate frame all covariant 
derivatives coincide with usual ones, and all first derivatives of the metric tensor 
vanish (by virtue of (A.21)). The transformation to such a frame at a chosen point, 
which we place at the origin, is 


1 
gh — g = gh + aria Oaa, (A.29) 


where T”, (0) are the values of the Christoffel symbols in coordinates x at the origin. 
Using the relation (A.16), it is straightforward to see that all Christoffel symbols 
indeed vanish at the origin in the new frame. Note that the key role here is played 
by the symmetry of the Christoffel symbols in the lower indices, formula (A.22). 

Since the transformation (A.29) is identity at the origin, there is still freedom in 
the metric tensor. This freedom can be used to reduce the metric tensor at the origin 
to the Minkowski tensor. To this end, one simply chooses x” = J} x”, where J does 
not depend on coordinates. In matrix notations we then have the relation (A.9). 
The matrix gy can be cast into diagonal form by orthogonal transformation, and 
then converted to the Minkowski tensor by rescaling of coordinates. The resulting 
coordinate frame has thus the properties 


Gur (0) = Mv, TY ,(0) = 0. 


It is called the locally-Lorentz frame. 
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It is seen from formula (A.23) that the Christoffel symbols differ from zero if metric 
non-trivially depends on the coordinates x”. It should be understood, however, that 
the deviation of Nee from zero does not imply that the space is not flat. Since the 
quantities Tiy do not form a tensor, they can be identically equal to zero in one 
coordinate frame and differ from zero in another frame. 


Problem A.6. Find the Christoffel symbols in polar coordinates on 2-dimensional 
plane and in spherical coordinates in 3-dimensional Euclidean space. 


6In fact, the stronger statement is valid: it is possible to make all Christoffel symbols equal to zero 
along any predetermined world line. 
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Fig. A.3. Parallel transport of a vector along different trajectories (12) and (34). 


The quantity which characterizes the geometry of manifold, rather than the choice 
of coordinate frame, is the Riemann tensor (curvature tensor) R’,,. The Riemann 
tensor determines how the commutator of covariant derivatives acts on tensors. For 
example, for an arbitrary contravariant vector A* we have 


Vive’ -= V, Ha". 


omy" 


(A.30) 


Problem A.7. Check that the equality (A.30) indeed defines the tensor Ri. In 
particular, check that all terms with derivatives of Aò, which could appear in the 
left-hand side of this equality, cancel out. 


Explicit expression for the Riemann tensor is 
Riig = ðh, — Ooh, + rt =r Tp (A.31) 


In order to better understand the geometric meaning of the Riemann tensor, let us 
consider parallel transport of vector Aò from the point x with coordinates x“ to 
the point č with coordinates 


Z“ = x“ + dy” + dz", 


where directions of vectors dy“ and dz“ do not coincide (see Fig. A.3). This parallel 
transport can be done in two different ways: (i) one first moves the vector Aà along 
the path 1 to point y with coordinates 


yall ae lh u 
Ty) =T + dy”, 


and then proceeds along the path 2 to point ž; (ii) one does the opposite, namely, 
first performs parallel transport along the path 3 to the point z with coordinates 


Tia) = g" + dz", 


and then moves along the path 4 to point z. Of course, in flat space the result of 
parallel transport does not depend on the choice of path. In the case of curved space 
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this is, generally speaking, incorrect. Using the parallel transport rule (A.13), one 
can see immediately that the result of the transport does not depend on the path 
to the linear order in increments of coordinates. However, in the quadratic order we 
obtain 


JA Jà o pÀ 
A` (12) — A^ (34) = A° Ro, 


dz”"dy”, (A.32) 
where A*(12) and A>(34) are images of the vector A^ under parallel transport along 


paths (12) and (34), respectively. Thus, the tensor R}, , determines the dependence 


of the parallel transport on the path along which it is done. Consequently, the 
Riemann tensor is indeed a non-trivial characteristic of the curvature. 


Problem A.8. Obtain equality (A.32). In particular, check that the second order 
terms in dx”, omitted in the parallel transport law (A.13), do not contribute to the 
difference (A*(12) — A*(34)) to the quadratic order. 


Problem A.9. Using the Christoffel symbols found in Problem A.6, check by 
explicit calculation that all components of the Riemann tensor are zero in polar 
coordinates on plane and in spherical coordinates in 3-dimensional Euclidean space. 


The above analysis is valid, with minimal changes, for covariant vector A,. The 
analog of Eq. (A.30) in that case is 


VpVvAx — ViuVpAx = —Ag Bh iy (A.33) 
The action of the commutator of covariant derivatives 
[Vs Vv) = Vivo g ViVi 


on tensor of arbitrary rank follows from the fact that the operator [V,,, Vy] obeys 


the Leibniz rule. For example, 
[Vu Vi) AS = Fe iw) _ Suv Ahe (A.34) 


Problem A.10. Prove the Leibnitz rule for commutatior |V q, Vv]. 
Let us list some important properties of the Riemann tensor. 
(1) Tensor 

Ruvrp = Guo Rizo 


is antisymmetric in the first and second pair of indices. 
(2) Tensor Ry», is symmetric under permutations of pairs of indices (uv) > (Ap). 
(3) For any three indices, the sum of three components of the tensor Rpvap with 
cyclic permutation of these indices is zero. For instance, 


Roma + Rory + Roan = 0. (A.35) 
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(4) The Bianchi identity holds: 
VpRouw + Vo Repu + VuRoup = 0. (A.36) 


opu 
Problem A.11. Using the explicit expression (A.31) for the Riemann tensor, prove 
the properties (1) and (2). 


Problem A.12. Using properties (1), (2), (3) determine the number of independent 
components of the Riemann tensor at each point in a space of dimension D = 2,3,4. 


Proving the properties (3) and (4) by using the explicit formula (A.31) would be too 
cumbersome. Instead, it is convenient to make use of the definition (A.30). Namely, 
let us write the following equality (Jacobi identity), valid for arbitrary operators, 


[A, [B, Cl] + [C, [A, B]] + [B, [C, A] = 0. 


Problem A.13. Prove the Jacobi identity. 


Let us choose the operators A, B,C as covariant derivatives. Let us then act by the 
Jacobi identity on arbitrary scalar ¢ first, 


[Vp [Vu, ViI|¢ ag [Vu [V., Vello + [Vi, [Vp Vallo =0. (A.37) 
Let us write for the first term 


[Vp [Vn Val] = VolVus Vile = [Va VulV pd = -[V;, Vo] Vp% = -QRZ 


ppv? 


and similarly for the other terms. Here we first used the fact that [V,,Vv]¢@ = 
OpOve — Tà Ox¢ is symmetric in u, v, and then utilized (A.33). In this way we 
obtain from the identity (A.37) that 


(RZ Re + RE 


puv pup vop) 


Ono = 0, 


which implies (A.35) by virtue of the arbitrariness of 0,¢. 
Let us now act by the Jacobi identity on arbitrary vector A* and get 


[Vo, [Vs V,]]A^ Ez [Vn [Vv, VJ] A> FE [Vo [Vp, V„]]4^ = 0. (A.38) 
Then, using the definition (A.30), we have 
[Vos [Vu V] A = Vo(Ro yA?) — [Vu Vo] (Vp A’). (A.39) 
Evaluating the right-hand side of Eq. (A.39) and using the Leibniz rule, we find 
[Vp, [Vu ViI|A* = Vp Rèu A" oie ae _ |S Vp AT + R u Vo A` 


= V R} p AT + RË Ve (A.40) 


‘Oo [LV puv 


Upon substituting the expression (A.40) and similar expressions for the other two 
double commutators to the Jacobi identity (A.38) we obtain, taking into account 
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the property (3) of the Riemann tensor, 
(VF T VLR} T Viet = 0. 


opp 


Consequently, the Bianchi identity is indeed satisfied. 

By contracting indices of the Riemann tensor Ry), one can construct a new 
tensor with smaller number of indices, still characterizing the curvature of space. The 
above symmetry properties of the Riemann tensor guarantee that the contraction 
of any two of its indices results in either zero or the following symmetric tensor of 
the second rank, 


Ruy = Bis 
called the Ricci tensor. In many places of this book we need the explicit form of 
this tensor, 


Rip = Ol — Oh ol ly Pr (A.41) 


By contracting the indices of the Ricci tensor we obtain the scalar curvature, 
R= g Rw. 


Problem A.14. Find the metric, Christoffel symbols, Riemann and Ricci tensors 
and scalar curvature for 2-dimensional sphere 9°. 


Problem A.15. Show that for an arbitrary 2-dimensional surface, the quantity 
VIR is total derivative and hence the integral of the scalar curvature over the invari- 
ant volume, 


1 2 
= J a/gR, (A.42) 


does not depend on the choice of metric on the surface (Gauss—Bonnet theorem). 
Thus, in two dimensions this integral is a characteristics of topology. Note that the 
scalar curvature is twice the Gaussian curvature in two dimensions. Integral (A.42) 
is in one-to-one correspondence with the degree of the Gauss mapping and coincides 
with the Euler characteristic of 2-dimensional surface. Find the value of this integral 
for sphere and torus. 


A.4. Gravitational Field Equations 


We now have at our disposal all ingredients needed for constructing General Rela- 
tivity. In GR, the metric tensor is dynamical field (“gravitational field”), and the 
equations of GR arise as extremum conditions for the action functional. As we have 
already mentioned, one of the basic principles of GR is that all local coordinate 
frames are equivalent. This means that the equations for the gravitational field g„v 
should be written in terms of covariant quantities and their form should not depend 
on the choice of coordinates. To fulfill this requirement, the gravitational field action 
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Sgr must be a scalar, i.e., it must be written as an integral of a scalar Lagrangian 
Lgr over the invariant 4-volume, 


Sy = | Hayato 


The simplest possibility is to take the Lagrangian equal to a constant (—A) inde- 
pendent of metric, 


Sa = —A J d ryg. (A.43) 


This term can indeed enter the action for the gravitational field and play an impor- 
tant role in cosmology. Since the action is dimensionless, the parameter A has 
dimension (mass)*. This parameter is called the cosmological constant or, for rea- 
sons that are explained in Chapter 3, vacuum energy density. However, the action 
(A.43) cannot be the complete gravitational field action. Indeed, S, does not con- 
tain derivatives of the metric g,,, and, consequently, upon its variation one would 
obtain purely algebraic equation that would not enable one to interpret guv as truly 
dynamical field. 

Another scalar quantity at our disposal is the scalar curvature R, or, in general, 
an arbitrary function f(R). To understand which choice of the function f(R) as the 
Lagrangian is most natural, we recall that the commonly used field equations are of 
the first or second order in derivatives. For the field equations to be second order at 
most, it is usually required that the Lagrangian does not contain second or higher 
derivatives. Indeed, consider field theory with the action of the form 


S= I d‘xL($,0,0°¢,...). (A.44) 


Here, the symbol ¢ denotes all fields of theory, and we omitted possible tensor 
indices. The variation of (A.44) under small variation of the fields, 6 > ¢ + d¢, 
reads 

OL OL OL 

ôS = pes (Zso —— 056 + ——— 3’ sp +- J; 

ðh d(0¢) (09) 
Assuming, as usual, that the variation of the fields 6¢ vanishes at infinity, and 
integrating by parts, we arrive at the field equation of the form 


OL OL z OL 

De 1G) ald op * 0, (A.45) 
which, generally speaking, contains the field derivatives of higher than second 
order. 

The Riemann tensor (A.31), and, consequently, the scalar curvature R, contain 
the first derivatives of the Christoffel symbols eae The latter, in turn, contain 
the first derivatives of the metric tensor gv. Therefore, if the Lagrangian density 
Lgr depends non-trivially on the scalar curvature, the action necessarily involves 
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the second derivatives. By the above argument, one could conclude that it would 
be impossible to write covariant action for the gravitational field, which would 
lead to second order equations. Note, however, that if the Lagrangian density £ in 
Eq. (A.45) depends on second derivatives only via the terms of the form 


f(o) 06 (A.46) 


and does not contain higher derivatives, then the equations of motion do not involve 
higher derivatives. In fact, one can integrate the term (A.46) in the Lagrangian 
by parts and come to the Lagrangian which contains first derivatives only. It is 
straightforward to check that the action 


1 4 


depends on the second derivatives exactly in this way. 


Problem A.16. Using integration by parts, find the action equivalent to the action 
(A.47), which does not contain second derivatives. Is the Lagrangian in this action 
a scalar? Is the action itself a scalar? 


This action is called the Einstein—Hilbert action. As we will see later, the constant 
G here, which has the dimension M~?, is Newton’s gravity constant. 

The full gravitational field action in GR is the sum of the terms (A.43) and 
(A.47), 


Sor = Sa + Sex. (A.48) 


In order to obtain the gravitational field equations one has to calculate the variation 
of the action Sgr under variation of the metric 


Juv > Gp + O9uv- 


Let us start with the first, simpler term S4. To vary S,, we make use of the following 
formula of linear algebra, 


det(M + 6M) = det (M) [1+ Tr(M7'6M) + 0(5M)) , (A.49) 
where M is an arbitrary non-degenerate matrix. 
Problem A.17. Prove the formula (A.49). 
Applying the relation (A.49) to the determinant of the metric tensor, we obtain 


69 = gg” guv- (A.50) 
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Using this result, we arrive at the following expression for the variation of SA, 


6S, = -A f a'e5(V=9 = -$ | atey=G0"” by (A.51) 


Let us now calculate the variation of the Einstein—Hilbert action Sgp. It can be 
written as a sum of the following three terms, 


OS EH = 05, +652 + 693, 


where 


1 
E -g | ERINO 
T 
= 1 4 pv 
ôS = — wale Ty —gRuvðg , 
and 


683 = d'x/—ggh” ORyv- (A.52) 


1 
167G 


Taking into account the relation (A.51), we immediately obtain the explicit expres- 
sion for 65}: 


551, = -ag | dev=GR" gu (A.53) 
In order to compute S2, we note that varying Eq. (A.8) we get 
Jorg? = — gh? gpa- 
Contracting both sides of this equality with matrix g*” we obtain 
Sgt” = —g P 8gp g. (A.54) 


Consequently, 


= 1 4 HvV 
S2 = wale Ty gR O9uv- (A.55) 


It remains to find the variation 6$3, which at first glance looks the most complicated. 
To compute 653, we note that the transformation rules of the Christoffel symbols 
(A.16) imply that the variation dT’, is a tensor. Then, using (A.31), we arrive at 
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the following expression for the variation of the Riemann tensor: 
ORY, = ð rh, — 3p TE + usd bie + Fol =O el 3 = Ee el a 
By direct calculation one can check that 


ORY, = VATE) — VT), (A.56) 


where the covariant derivatives are taken with the unperturbed metric. From the 
equality (A.56) we get the following expression for the variation of the Ricci tensor 


bRyy = Va (TA) — Vi (Ta) (A.57) 


Substituting Eq. (A.57) into Eq. (A.52), we obtain 


1 V 
653 = -a | iv" [Va (Tà) — Vi(T Ay) ] 


1 
= | &a/— HUST gh 6p .), A. 
ag | teva alot PA, — Po), (A.58) 


where we put the tensor g”” inside the covariant derivatives in the second equality 
and renamed the summation indices v and À in the second term. Finally, making 
use of the property (A.26), we rewrite 0.$3 as an integral of the total derivative, 


1 
6 eee dł pv Tà — „HÀ Te). 
Sy = -gg | PPIM DA, ~ ITF) 


Consequently, S3 does not contribute to the field equations. Assembling together 
the variations (A.53) and (A.55), we derive the variation of the Einstein—Hilbert 
action, 


= 1 4 = HV 1 pv 
ôSEH = TG fa ryg (r 59 R) Oguy (A.59) 


This formula and Eq. (A.51) give the following gravitational field equations (Einstein 
equations in vacuo): 


1 
RY zI” R = 8GA g”. (A.60) 


We see that the Einstein equations are indeed second order in derivatives. The 
Einstein equations are often written in the form 


Guy = 8TG guv, 
where 
1 
Giv = Rop = zImR 


is the Einstein tensor. 
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A.5. Conformally Related Metrics 


It is useful for some applications to have relations between the Ricci tensors and 
scalar curvatures of metrics, conformally related to each other. Suppose that there 
are two metrics guv and g,,, such that 


Guv (2) = e g(x), (A.61) 


where (x) is some scalar function of coordinates. Our purpose is to express 
Rip and R — the Ricci tensor and scalar curvature constructed from the met- 
ric Gy» — in terms of R,, and R obtained from the metric gav. To this end, we 
first find the relationship between the Christoffel symbols. The direct substitution 
of (A.61) in (A.23) yields 


Dy = Tia + OLE + HO — gung’ Opp. 


As a result of substituting this expression in Eq. (A.41) and straightforward (but 
tedious) calculation we obtain 


Rw = Rup — 2V Vve = Jurg PVAV pL + 20,PO,~ a 2g uvg On Poy, (A.62) 


where the covariant derivatives are evaluated with the metric gv. Hence, for the 
scalar curvature R = ĝ”” Ry, we have 


Ê = e7 (R — 6g” V Vvo — 6g” 0,00), (A.63) 


while for the Einstein tensor we get 


Cr hes ôm Ê 


= Guu — 2V Vve + 20,PO,9 + gur (2VaV `p + app), (A64) 


where indices in the right-hand side are raised and lowered by the metric guv- 
Finally, for the integral entering the gravitational field action, the relationship has 
the form 


J Řy-—ôĝd*z = J e°? RyZgd*x + 6 / e”? gh Op py/— Ga x. 


The latter relation is obtained with the use of (A.63) and integration by parts. 
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As an example of application of these formulas, we consider “non-linear gravi- 
tational theories” with actions of the form 


S= - | dav =5f), (A.65) 


where f(R) is an arbitrary function of the scalar curvature R. We are going to 
prove that in vacuo, these theories are dynamically equivalent to conventional grav- 
ity (General Relativity described by the Einstein-Hilbert Lagrangian) plus self- 
interacting scalar field.” 

We first find the field equations obtained by varying the action (A.65). We write 
the variation again as the sum of three terms, 


5S = 08; + 854 465s, 

55, =~ | ate FR) AVO), 

S2 =~ | dtey=Gf" R) Rud, 
884 = = | abey=Ggl "(BPR 


where f’(R) = Of(R)/OR. The variations 65; and Sə are simple generalizations of 
the analogous expressions for the Einstein—Hilbert action (see (A.53) and (A.55)), 


8) =- | dtey=GF (Rg gy (A.66) 
isi J d's y gf (R) R” Sgu. (A.67) 


To calculate 8S3 we use (A.57), where we substitute the variation of the Christoffel 
symbols 


1 
a ~ 59? (VndGup + Vi69up — Vp 9uv): 
As a result we obtain from (A.57) that 


1 
Ruw = zC VaV Ou + V`V pôgav + V*Vi Sonn — Vi V pôg3). (A.68) 


Then the variation 653 reads 
AE / d= 95g u(VEV” — gt” VxV*) F(R), (A.69) 


where we integrated by parts twice. In the case of the Einstein—Hilbert Lagrangian 
f’ =1, so the expression (A.69) vanishes identically. 


For the sake of convenience, in Eq. (A.65) and to the end of this Section we adopt the system of 
units where 167G = 1. 
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Finally, equating to zero the variation 6S we obtain the field equations for the 
theory with the action (A.65): 


SS (Ru — f'(R)Rwv + (Va Vv — gw V`Va) f (R) = 0. (A.70) 


Note that these equations contain fourth order derivatives. 
It is convenient to introduce the new variables g,, by making a conformal 
transformation: 


Guv = P iu, Y = f'(R). (A.71) 


We will assume that % > 0, then the new metric Juy has the same signature as the 
metric guv. The relationship between the Ricci tensor are scalar curvature of these 


two metrics is given by Eqs. (A.62), (A.63) with p = —4 In y. We have, therefore, 
Rw = Ry tv ÕI + spina Y 
-z PV nb + 2gp VAY V>), (A.72) 
R=R+4+3V,V4y — Su, (A.73) 


where symbols with tilde are calculated, and indices are raised and lowered with 
the metric Juv. 
Let Ro(w) be a solution to the equation 


fF [Ro(~)] -4 =0 


(the analysis below can be extended to the case of multiple solutions), i.e., Ro is 
the inverse function to f’. It follows from the second of Eq. (A.71) that 


R= Ro(w). (A.74) 


In terms of new variables, Eq. (A.70) takes the form 


= 0? S FIAR) = VRO + SV 0 = Faw TaT) 
(A.75) 


We still have to account for Eq. (A.74). Its left-hand side is given by (A.73) where 
R is obtained by contracting (A.75) with g“”. In this way we get the equation 


DÜ ČA = Vv + 5 {URo(W) —2F[Ro(w))}=0. —— (A.76) 


So, instead of fourth order equation (A.70), we obtain an extended system of second 
order equations (A.75), (A.76) in terms of the new variables g,,, and w . This system 
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coincides with the equations of conventional gravity interacting with the scalar field 
w. The action whose variation with respect to Jay and wW gives Eqs. (A.75) and 
(A.76) is 


s=- | a'sy-5Ř 


a EI 33 VuVu Roh) FIR) 
+fa al J + a i (A.77) 


Problem A.18. Obtain Eqs. (A.75) and (A.76) by varying the action (A.77). 


The kinetic term in the action (A.77) can be cast into the canonical form by the 


replacement wv = eV 34, Finally, we arrive at the action 


S= fiva {-R +9” V Vro 
+e- VĒ Ro (ev) — e234 f [Ro (ev39)] } 


It describes self-interacting scalar field ¢ in the framework of GR. As we have seen, 
this theory is dynamically equivalent to the theory of “non-linear” gravity with the 
action (A.65). 


Problem A.19. Consider scalar-tensor theory of gravity with the action 
1 V 
S= | day=5( -R+ Zulo aede -Vio ); 


where p is scalar field. Under what conditions on functions w(p) and V(ẹ) this 
theory is equivalent to f(R)-gravity? 


Note that as a result of the conformal transformation, matter fields begin to 
interact with the dilaton field ¢. This results in various physical effects. In particular, 
for homogeneous solution ¢ = ¢(t) the “cosmic time” (time in the FLRW metric) 
differs from the “atomic time” (time determining the evolution and interaction of 
matter fields). In this sense, in the presence of matter fields f()-gravity is not 
equivalent to GR with scalar field. 


A.6. Interaction of Matter with Gravitational 
Field: Energy-Momentum Tensor 


Equations (A.60) describe the dynamics of the gravitational field in vacuo. However, 
it is of primary interest to study gravity in the presence of matter fields which serve 
as sources of gravitational field. To study this general situation, we add to the action 
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(A.48) a new term, 


a= / d tf =OL ins (A.78) 


that describes matter and its coupling to gravity. Here the Lagrangian density Lm 
is a scalar function of the gravitational field g,, and matter fields. We collectively 
denote the latter by W: 


Lm = Lm(Y, Juv). 
Once we add the term (A.78), the Einstein equation (A.60) get modified as follows, 
1 
Ruy — 5 IR = 89G(Aguy + Tav). (A.79) 


Here we turned to tensors with lower indices. The symmetric tensor Ty is defined 
by the following relation, 


1 
Sm = a J dtz =9T puð: (A.80) 
The latter relation can be rewritten with account of (A.54), 


1 
Sm = 5 J dry gT” Suv. 


Equation (A.79) (with upper indices) is obtained then by using (A.59). 

To understand the physical meaning of the tensor Tv, we calculate it for two 
simple theories: scalar field theory and theory of electromagnetic field. The covariant 
action describing the real scalar field interacting with gravity reads 


ius J R E a J PNT CA z vo), (A.81) 


where the scalar potential V (¢) can be an arbitrary function of the field ¢. Generally 
speaking, one can add to the action (A.81) yet another term which vanishes in flat 
space, 


Se =€ | d'ey=GRU(O), (A.82) 


where U(@) is an arbitrary function. We restrict ourselves to the choice € = 0. In 
this case, the interaction of the scalar field with gravity is called minimal. 

Using the definition (A.80), we obtain the following expression for the tensor 
Tyv of the scalar field, 


Ta = On O09 — Juv Lsc- (A.83) 
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Problem A.20. Find the tensor Tay for free massless scalar field non-minimally 
coupled to gravity, € #0, by choosing 


and V(¢) = 0. At what value of the parameter € does the trace g"”T,,, vanish on 
the equations of motion? For what V(@) does this property remain valid? 


We now find the explicit form of the tensor Tuy for the electromagnetic field. 
The action for the vector field A, coupled to gravity has the form 


1 
Serm = -I [ dey gF wy Frpg'*g’?, (A.84) 
where F» is the conventional field strength tensor, 
Fp = OpAp — Op Ap. (A.85) 


At first glance, the ordinary derivatives in the definition (A.85) should be replaced 
by covariant derivatives in curved space. But it is straightforward to check that for 
symmetric connection, the terms with the connection cancel out due to antisym- 
metrization in 4 and y, so that V, Ap — Vy Ap = Oy Av — r Ap, and Fur is a tensor. 
Using the action (A.84), we obtain the following expression for the tensor T,,, of 
electromagnetic field, 


1 
Tio — -Faa Fpp Sy Iu Pap F. (A.86) 


Now we note that in Minkowski space, the tensor Tuy coincides with the energy- 
momentum tensor for both scalar and electromagnetic field. For scalar field, Ti; is 
exactly equal to the Noether energy-momentum tensor, while for the electromag- 
netic field, 7% differs on the equations of motion from the Noether tensor by the 
divergence of an antisymmetric tensor. 


Problem A.21. Prove these statements. 


In particular, the (00)-components of these tensors in Minkowski space are 


Tig = 5000)? + 5(00)? + V(6) 


and 
1 1 1 
These are indeed the energy densities of scalar and electromagnetic fields. 
Generally, the tensor T,,, defined by Eq. (A.80) is called metric energy- 
momentum tensor. We emphasize that it is always symmetric. Below we prove that 


TS" = (E? +H). 


in Minkowski space and on equations of motion, it is always equal to the Noether 
energy-momentum tensor modulo the divergence of an antisymmetric tensor. 
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The energy-momentum tensor is conserved in flat space, 
Opt” = 0. (A.87) 


This leads to the conservation of energy and momentum. It is natural to assume that 
the generalization of the conservation law (A.87) to curved space is the covariant 
conservation equation 


V aT” = 0. (A.88) 
To see that this is indeed the case, we take the divergence of both sides of the 
Einstein equations (A.79), 


1 
ve (Fy. z TA = 81GV"T yy. (A.89) 


Let us prove that the left-hand side of Eq. (A.89) is identically zero. To this end, 
we first contract the indices À and p in the Bianchi identity (A.36). As a result, we 
obtain the following identity, 


VpRov — Vv Rop + Vito, =D: 


Then we contract it with g’? and arrive at 
1 
0= V pie =N phe V` Ryw = 2V" (Bu = 5a) . 
Thus we obtained the identity 


1 
ve (Fy. — saw) = 0, 


which implies that the covariant conservation law of energy-momentum tensor 
(A.88) is a necessary condition for the consistency of the Einstein equations. 

On the other hand, the energy-momentum tensor is entirely determined by the 
action of matter fields. Therefore, to check the consistency of the whole system 
of field equations, one needs to prove that the conservation law (A.88) is a conse- 
quence of matter field equations. The proof makes use of the invariance of the action 
under coordinate transformations. We begin with evaluating the first variation of 
the metric g”” under small coordinate transformation 


gP = oh 464, (A.90) 
Substituting Eq. (A.90) to the general formula (A.7), we get 
g(a!) = (5K + OrxE")(5p + 3E”) (2) 

= gh (x) + g*O,E" + g>”, (A.91) 
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where we neglected terms of the second order in é”. Expanding the left-hand side 
of Eq. (A.91) we write 


g/t (ar") = g™” (x) + Ang” (a)E* + OE) = g” (x) + Oagh” (x)E* + OE). 


In this way we obtain the following relationship between the functions g”” (x) and 
g'”” (x) taken at points with the same coordinates in the old and new coordinate 
frames, 


g” (x) = gh” (x) — Ong” (x)E + g ^I + ghOnEY. (A.92) 


Problem A.22. Check by explicit calculation that the relation (A.92) can be written 
in the following covariant form: 


g/t” = gh + HEY 4 En. (A.93) 


Now, the invariance of the matter action under coordinate transformations means 
that the variation of this action is equal to zero when simultaneously metric g”” (x) 
is transformed as in (A.93) and matter fields are transformed accordingly, w'(x) = 
W(x) + pels). The form of dwWe is dictated by the transformation properties of 
matter fields under the coordinate transformation (A.90). For instance, for scalar 
field ġ 


dpe = —E" OO. 


So, in the general case we have 
1 
b¢Sm = 5 | PaT" + VE) + fate V/- ot be = =0. (A.94) 


The identity (A.94) is valid regardless of the field equations. Now we assume in 
addition that the matter field equations are satisfied. Then the second term on the 
left-hand side of Eq. (A.94) vanishes. Therefore, we proved that the matter field 
equations guarantee the validity of equality 


J diry ZgT (VHE + VEN) =0. 


Since the vector €" is arbitrary and T),, is symmetric, we arrive, upon integration 
by parts, at the covariant conservation law (A.88), as required. 

We now use the identity (A.94) to prove that in flat space, the metric energy- 
momentum tensor T, coincides on the equations of motion with the Noether tensor 
Tuy» modulo total derivative. In flat space, the identity (A.94) takes the form 


/ d* aT "E + i phen Bb ove = 0, (A.95) 


where we again used the symmetry of the tensor Ty. In Minkowski space, the action 
is invariant under the variations of matter fields dW¢ corresponding to shifts (A.90) 


A.6. Interaction of Matter with Gravitational Field: Energy-Momentum Tensor 469 


with constant é”. Consequently, the second term in (A.95) can be written as 


J ao Ove = — J dar yO"e", (A.96) 


where 7,,, coincides on the equations of motion with the conserved Noether energy- 
momentum tensor. 


Problem A.23. Modifying the proof of the Noether theorem, show the validity of 
the relation (A.96), where Ty, is equal on the equations of motion to the Noether 
energy-momentum tensor. 


Integrating by parts Eq. (A.95), we see that 
OY (Tay — Tuy) = 0. 


This is the identity valid irrespective of the equations of motion. The identity can 
only hold if the difference (Tuv — Tyv) is the divergence of an antisymmetric tensor, 


Tiu = Tap = Aur, where Ava = —A)vy; (A.97) 


as promised. 
The total derivative of the form (A.97) does not contribute to the total energy— 
momentum 4-vector 


P= parm. 


Indeed, because of the antisymmetry property of A va, its contribution is of the 
form 


I r3 A”. 


This integral is zero for fields vanishing at spatial infinity. Hence, metric and Noether 
energy-momentum tensors are equivalent in flat space in the sense that they give 
equal values of energy and momentum. 


Problem A.24. Consider tensor of the form 
Ow = Gin = ô 3v) f, 


where f is an arbitrary function. Clearly, this tensor is identically conserved in flat 
space. Express it in the form Ov = ^Am, where Apuva = —Aup- 


Problem A.25. Check explicitly that the metric energy-momentum tensor of scalar 
field coupled non-minimally to gravity, see (A.81) and (A.82), differs in flat space 
from the Noether energy-momentum tensor by the total derivative of the type (A.97) 
for arbitrary U (¢ġ) and £. 
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To conclude our discussion of energy-momentum tensor in GR, we make the follow- 
ing observation. In flat space, the differential conservation law (A.87) implies the 
existence of four conserved quantities, the components of the energy-momentum 
4-vector. However, Eq. (A.88), in general, does not imply the existence of four 
integrals of motion in curved space, which could be interpreted as the energy and 
momentum of the system. In this regard, the concepts of energy and momentum, 
generally speaking, are not defined in GR. For spatially localized gravitating sys- 
tems one can define the energy and momentum by making use of the asymptotics of 
the gravitational field far away from the system, but in general such a construction 
is impossible. In particular, talking about the total mass of the Universe does not 
make sense. 


A.7. Particle Motion in Gravitational Field 


As a digression from the discussion of the properties of the Einstein equations, let 
us study the motion of point particles in external gravitational field. The action for 
point particle in GR has the same form as in special relativity, 


Sp = -m f as (A.98) 


In GR, the interval along the world line of a particle involves the space-time metric, 


dat dx” 
ds = y dz" dz" guy = S g suv 


where we introduced the affine parameter r along the world line, and guv = 
Juv [x(7)]. In terms of the affine parameter, the action (A.98) is 


Sp= -m f \/ hE" gur(x)dr, (A.99) 


where dot denotes differentiation with respect to 7. Equation of motion obtained 
by varying the action (A.99) is 


d Gupt” 1 tòr” Jur 
— -— = 0. A.100 
T (=) +3 VBE Gm) 


Problem A.26. Derive Eq. (A.100). 


The affine parameter T can be chosen in such a way that the 4-velocity vector 


da 
pi ee 
z dtr 


(A.101) 


has unit length at each point of the world line, 


Juvuřu” =1. (A.102) 
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This choice identifies the affine parameter with the proper time of the particle, since 
Eq. (A.102) is equivalent to 
ds = dr. 


With this choice of the world line parameterization, equation of motion (A.100) 
becomes 


d 1 
-gg Gut”) + SOngrrwu” = 0. (A.103) 


Contracting this equation with g”°, we obtain 


du? z l 
oY _ gre (2% - Sông) u” =0. (A.104) 


Since guv = guv[æ(s)], the definition of the 4-velocity gives 


duv 


à 
da = O) JuvU 4 


Substituting this expression into Eq. (A.104) and using the formula (A.23) for the 
Christoffel symbols, we finally arrive at the following form of the equation of motion, 
du” 


Fa ou = 0. (A.105) 


Multiplying this equation by ds and noticing that 
dx* = uòds 
along the particle world line, we rewrite Eq. (A.105) as 
du” + T¥,utda* =0. 


We recall now the parallel transport law of contravariant vectors (A.13) and observe 
that the geometric meaning of Eq. (A.105) is that under parallel transport along 
the world line the normalized tangential vectors u[x(r)] transform into each other. 
Curves which satisfy this property are geodesics (shortest paths), and Eq. (A.105) 
is the geodesic equation. 

The action (A.98) does not make sense for massless particles, m = 0. To find 
the world lines of these particles (e.g., rays of light) one directly uses the geodesic 
equation 

du” T A Bem 

T +Ty,u"u* = 0, (A.106) 
where 7 is now an arbitrary parameter along the world line, and 4-velocity u” is 
still defined by Eq. (A.101). In the massless case, the geodesic must be light-like 
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(null), 
ds? = 0, 
or, in differential form, 
Juv" t” = guyuu” = 0. (A.107) 
Problem A.27. Show that Eq. (A.106) is consistent with the requirement (A.107). 


Problem A.28. Check that the equation of motion of massive point particle as well 
as the equation for light-like geodesic can be obtained from the action 


1 
= = y dr n [P i i" gu (£) +m? , (A.108) 


where n(T) is a new auxiliary dynamical variable (“Dynamical” here means that one 
of the equations of motion is obtained by varying the action with respect to n.) This 
new variable transforms under the change of parameterization as 


rro = ZE] 


Note that nņ?(T) can be considered as internal metric on the world line, while the 
action (A.108) can be viewed as the action of four fields x” (r) in 1-dimensional 
space with dynamical metric. 


Equation (A.106) can be obtained by considering wave equation in curved space-time. Let 
us discuss photons for definiteness. The Maxwell equation that follows from action (A.84) 
reads 


VFM” =0. 


If the wavelength and wave period are small compared to spatial and temporal scales 
characteristic of metric, one neglects terms involving the Ricci tensor. One chooses the 
gauge V, A” = 0 and writes the field equation as follows: 


VVL A” =0. (A.109) 
Solutions to this equation have the form 
A! = ate’? 


where a” (x) is a slowly varying amplitude, and S(x) is a large phase called eikonal (see, e.g. 
the book [267]). Locally (e.g., near x” = 0) this phase can be written as S = const + P x“. 
This is a standard expression for plane wave, and Po and P; are photon frequency and 
wave vector (momentum). Thus, the local 4-momentum is 


P, = 8,8. 


If metric varies slowly, then P, varies slowly too, even though P, itself is large. 
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The latter observation shows that the leading term in Eq. (A.109) involves square of 
the derivative of S. Therefore, to the leading order this equation is reduced to 


P,P" =0. (A.110) 
Since P, is gradient of S, it obeys 
Vi, — Ve PL = 0. 


One multiplies the latter identity by P” and makes use of (A.110) to get P’V,P, = 0. In 
this way one arrives at equation 


PHT P” = P8 PET PRP’ = 0. (A.111) 


There is a set of world lines x”(r) associated with the vector field P” (x): vectors P” 
are tangent vectors to these lines. These are precisely photon world lines. The parameter 
7 along a worlds line can always be chosen in such a way that P“ = dx"/dr. Since 
P", P” (x) = (d/dr)P” (x(r)), Eq. (A.111) can be written as 

U Par =o, 

dt 
This is precisely the geodesic equation (A.106). As a byproduct, we have shown that 4- 
velocity u” = dx" /dr is nothing but the photon 4-momentum. 

To see that the parameter 7 is not arbitrary, and is rather determined by the geodesic 

equation, let us parameterize the geodesic line by time x° instead of r. Then 


dx’ dP — dP" o 


i p0 
eed dx?’ dr d? ’ 


and the temporal component of the geodesic equation (A.106) becomes 


dP® 


ago tT Pn, (A-112) 


where v” = (1,dzt/dz?). The spatial components of the geodesic equation become the 


system of equations for dx’ /dx?. Once its solution a’(x°) is known, one solves Eq. (A.112) 
and finds P“(x°). Then the parameter 7 can be found, e.g., by solving dx°/dr = P? (x°). 


A.8. Newtonian Limit in General Relativity 


Let us now discuss how the main object of the Newtonian gravity, the gravitational 
potential, arises in General Relativity and how the Newtonian gravity law follows 
from GR. To this end, we study the motion of a particle in weak static gravitational 
field, i.e., in space with metric 


Suv = Nv + Ay (X), (A.113) 
where nuv is the Minkowski metric, and all components of h,,,(x) are small, 


hyv(X) <1. (A.114) 
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Furthermore, we consider non-relativistic particles, 


i drt Zi 
y= : 
dt 


Let us write the explicit form of various components of the geodesic equation (A.105) 
to the linear order in the velocity v’ and gravitational field h,,,. We first note that 
in the linear order, the proper time of a particle ds is related to the coordinate time 
dt by 


ds = (1 + te) dt. (A.115) 


Problem A.29. Find in the general case the relation between coordinate time and 
proper time of a particle moving with the coordinate 3-velocity v' = d, Show that 
in the linear order this relation indeed reduces to (A.115). 


Hence, the 4-velocity components u” are related to the metric and physical velocity 
vt as follows, 


dt h 

— 00 

= — 1 - — 

ds 2° 

;_ det i 

t= RU. 
ds 


It is straightforward to check that in the linear order, the zeroth component of 
the geodesic equation is satisfied identically. Indeed, the first term at vanishes, in 
view of (A.115), because the metric is static. The second term T“, u”u^ 
small factor from the very beginning, since all Christoffel symbols vanish for the 
unperturbed metric n,n. Furthermore, the component T94 is equal to zero for static 
metric, hence the second term must involve the velocity u’ at least once. So, all 
terms in the zeroth component of the geodesic equation are at least of the second 
order. 
Spatial components of the geodesic equation in the linear approximation read 
du’ 


where we again took into account that the second term is inherently small due to 
the presence of the Christoffel symbols, so that the contributions involving velocities 
ut drop out. Recalling the explicit expression (A.23) for the Christoffel symbols, we 
arrive at the following equation describing the motion of non-relativistic particles 
in weak static gravitational field, 


contains 


dv’ 
dt 


where we introduced new function ®(x) defined by 


= —ð;ð, (A.116) 


goo = 1+ 290. 
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Equation (A.116) coincides with the equation of non-relativistic mechanics describ- 
ing the motion of a particle in an external potential ®(x), so the field ®(x) is 
identified, in weak field limit, with the Newtonian gravitational potential. Note 
that, as follows from the above analysis, the motion of non-relativistic particles in 
weak static fields depends only on 00-component of metric. 

To cross-check the interpretation of the field ®(x) as the Newtonian potential, 
let us see that the Poisson equation 


A® = 41Gp (A.117) 


indeed follows from the Einstein equations in the Newtonian limit. Here A = (0;)? 


is the Laplace operator. At the same time we will identify the constant G, entering 
the Einstein—Hilbert action, with Newton’s gravity constant. 

To this end, let us find, starting from the Einstein equations (A.79), the metric 
produced by a static distribution of non-relativistic matter with energy density 
p(x). Before doing that, it is convenient to rewrite the Einstein equations in the 
equivalent form. Taking the trace of both sides of the Einstein equations, we obtain 


R=-81rG(4A+T), (A.118) 
where 
PEG ly 


is the trace of energy-momentum tensor. Substituting (A.118) back into the Einstein 
equations, we arrive at the following equivalent equations, 


1 
Ryy = 81G le — 5 IT — tw). (A.119) 


This form of the Einstein equations is sometimes more convenient in practical cal- 
culations than the original one, because, as a rule, the curvature tensor R,, has 
more cumbersome structure than the energy-momentum tensor Tyv. 

Coming back to the problem of calculating the gravitational field, we assume that 
the cosmological constant is absent, i.e., A = 0, and that both energy density p(x) 
and all its spatial derivatives are small. The gravitational field produced by such a 
body is weak, i.e., the metric has the form (A.113). The only nonzero component 
of the energy-momentum tensor for static distribution of non-relativistic matter is 


Consider now the 00-component of Eq. (A.119). For weak gravitational field, we 
neglect the quadratic terms in the expression (A.41) for the Ricci tensor. Further- 
more, the second term in the expression for Roo vanishes for static metric. Conse- 
quently, the left-hand side of the 00-component of Eq. (A.119) takes the form 


1 
Roo = OTA, = 34900; (A.121) 
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where the latter equality is again obtained for weak and static field. Substituting 
this expression and the explicit form (A.120) of the energy-momentum tensor in 
Eq. (A.119) we obtain (at A = 0) Eq. (A.117). Hence, © is indeed the gravitational 
potential and G is Newton’s gravity constant. 


A.9. Linearized Einstein Equations about Minkowski 
Background 


Let us generalize Eq. (A.117) to the case of weak but otherwise arbitrary gravita- 
tional field about Minkowski background. In this case the metric has the form (cf. 
(A.113)) 


Gunl®) = Nv + Apu (2), 


where |hj,v(x)| < 1, and perturbations huy (x£) can depend on both time and spatial 
coordinates. We make use of the Einstein equations in the form (A.119), and set 
A = 0, so that Minkowski space is the solution for T y = 0. In fact, the computation 
of the Ricci tensor to the linear order in hy has already been performed: we can use 
the formula (A.68), considering it as the expression for the deviation of the Ricci 
tensor from its zero value in Minkowski space. Thus, we make the replacement 
Ôguv — Ay» in (A.68), replace the covariant derivatives with ordinary ones and 
raise and lower indices by Minkowski metric. As a result, we obtain the linearized 
equation (A.119) in the following form, 


(—O,O* hy + 83` hav + POI Ap — OLOA) 
= 167G 2 — T) À (A.122) 


where Tuy is assumed to be small. 
Equation (A.122) is invariant under gauge transformations 


br Site + Ook (4.123) 


Tyv = Tyv, 


where (a) are small gauge parameters. The transformation (A.123) is nothing 
but the linearized transformation (A.92); since Tuy is small, it does not change, 
in the linear order, under the small coordinate transformation (A.90). It is often 
convenient to take advantage of this gauge freedom and impose the harmonic gauge 


1 
Ah — z3LhÀ = 0. 


In this gauge, the linearized Einstein equations take particularly simple form 


1 
huv = —167G 2 = T) , 
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where O = 00 is the D’Alembertian in Minkowski space. Clearly, this equation 
describes massless gravitational field sourced by the energy-momentum tensor. 


A.10. Macroscopic Energy—Momentum Tensor 


To find solutions to the Einstein equations describing the expanding Universe filled 
with matter (e.g., relativistic plasma or “dust” ), we need the appropriate expression 
for the energy-momentum tensor. It is sufficient for our purposes to treat matter 
as ideal fluid and make use of the hydrodynamic approximation to the energy- 
momentum tensor. To obtain its explicit expression in curved space-time, we first 
consider the case of flat space. Isotropic fluid without internal rotation has the 
following energy-momentum tensor in its own rest frame, 


TH = (A.124) 


oO Oo Q Ff 


0 
P 
0 
0 


3 o S oO 


0 
0 
Pp 
0 


where p and p are energy density and pressure. Let us first generalize this expression 
to moving fluid. We note that in the rest frame, the 4-velocity vector is 


Therefore, we define a tensor object by 
(p+ p)u”u” — pn”. (A.125) 


Clearly it coincides with the energy-momentum tensor (A.124) in the rest frame. 
Since both energy-momentum tensor and the object (A.125) transform according 
to the tensor law, they coincide in all reference frames. The simplest way to gen- 
eralize the expression (A.125) to curved space-time is to replace Minkowski metric 
nt’ by the general space-time metric g””. Indeed, as we discussed above, there 
exists locally Lorentz reference frame at every given point of space-time. In this 
frame, the metric tensor at that point coincides with Minkowski tensor, and the 
matter energy-momentum tensor has the form (A.125). Performing the coordinate 
transformation to arbitrary frame, we arrive at the following final expression for the 
energy-momentum tensor, 


T!” = (p + putu” — pg”. (A.126) 


It is worth noting that, generally speaking, expression (A.126) is valid only in the 
case of weak gravitational field. In strong field, there may appear additional terms 
depending on the curvature tensor. 
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In general, density p, pressure p and 4-velocity u” are arbitrary functions of time 
and spatial coordinates, with the restrictions that 


ufuy, =1 (A.127) 
and 
Vit =0. (A.128) 


Equality (A.127) is a direct consequence of the definition of 4-velocity, u” = da" /ds, 
while equality (A.128) is the covariant conservation law for the energy-momentum 
tensor. 


Problem A.30. Write various components of the conservation law (A.128) explic- 
itly in the case of flat space. Show that in the non-relativistic limit (i.e., for 
lv| < 1,p < p) the resulting equations coincide with the hydrodynamic continu- 
ity equation and the Euler equation. 


To conclude this Section, we note that in the linearized theory with A = 0, 
Eqs. (A.119) and (A.121) give the following equation for the Newtonian potential 
in the presence of static source, 


A® = 4rG(Too a Tj). (A.129) 


(Summation over i is assumed.) For the energy-momentum tensor of the form 
(A.124) we have 


A® = 4nG(p + 3p). (A.130) 


In this sense, the source of the gravitational field in GR is not energy but rather 
the combination (p+ 3p). In particular, an object made of hypothetical matter with 
p+3p < 0 would repel rather than attract non-relativistic particles. (In other words, 
it would “antigravitate”.) The homogeneous isotropic Universe filled with such a 
matter would undergo accelerated expansion; see Sec. 3.2.4. 


A.11. Notations and Conventions 


Indices u, v,... refer to space-time and take values 0, 1, 2, 3. The summation over 
repeated indices is assumed. 

Indices i, j, ... refer to space, 7,7 = 1,2,3. Spatial vectors are marked with 
bold-face font. The summation over repeated lower spatial indices is assumed, e.g., 
aibi = ab, aja; = a’. 

The signature of metric is (+,—,—,-—). 
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The Riemann tensor is defined by 


Vie Vu] A SATR 


gpu 
Its explicit expression is given in (A.31). The Ricci tensor is equal to 


Rav = a, _ ON yy + Por _ len aa 


Minkowski metric is denoted by n,,, = diag(1,—1,—1,—1). The metric with small 
perturbations about the spatially flat Friedmann—Lemaitre—Robertson—Walker solu- 
tion reads 


ds” = a? (M) (Nw at lag LP dz”, 
where x° = n is conformal time. In other words 
Juv = a? (N) (Mw + hyv). 


Indices of hy are raised and lowered by the Minkowski metric. 
Our sign convention is that 


is a negative-frequency function. 


Appendix B 


Standard Model of Particle Physics 


In this Appendix we introduce the main elements of the Standard Model of parti- 
cle physics. Of course, our presentation cannot be exhaustive; we do not consider 
numerous concrete phenomena in the world of elementary particles. Our purpose is 
to briefly describe those aspects which are used in the main text. 


B.1. Field Content and Lagrangian 


The Standard Model is in excellent agreement with all known to date experimental 
data (with the exception of neutrino oscillations, see Appendix C), obtained in low- 
energy physics, high-precision measurements and high-energy physics [1]. The basis 
of the Standard Model is quantum field theory; see, e.g., books [268-271]. 

The Standard Model describes the following particles thought to be elementary 
to date: 


(a) gauge bosons: photon, gluon, W*-bosons, Z-boson; 

(b) quarks: u, d, s, c, b and t; 

(c) leptons: electrically charged (electron e, muon u and r-lepton) and neutral 
(electron neutrino ve, muon neutrino V, and T-neutrino Vr); 

(d) neutral Higgs boson h. 


66 99 6? 


Particles of types “a” and “d” are bosons, particles of types “b” and “c” are 
fermions. Fields describing particles of type “a” are gauge fields. They are vectors 
under the Lorentz group and serve as mediators of gauge interactions. In particle 
physics, fields of types “b” and “c” are often called matter fields; we will avoid this 
terminology. They are spinors under the Lorentz group and participate in gauge and 
Yukawa interactions.! The field describing the Higgs boson is scalar; it participates 
in the Yukawa interactions and self-interactions. In addition, the Englert-Brout— 
Higgs field plays a special role: its vacuum expectation value gives masses to all 
massive Standard Model particles. 


1Non-Abelian gauge fields also carry charge of their own gauge group and participate in gauge 
interactions. 
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The Standard Model gauge group is SU(3). x SU(2)w x U(1)y. It describes 
strong interactions (color group SU (3)e with gauge coupling gs) and electroweak 
interactions (SU(2)y x U(1)y with gauge couplings g and g’, respectively). The 
electroweak gauge group is in the Englert-Brout—Higgs phase, with the electromag- 
netic group U(1)em left unbroken. Accordingly, W=- and Z-bosons are massive, and 
photon remains massless. The Standard Model fields form complete multiplets with 
respect to these gauge groups, i.e., they belong to certain representations of these 
groups. 

Gauge fields are in the adjoint representations of their groups: there are eight 
gluon fields Gi, (a = 1,...,8, according to the number of generators of SU(3).<), 
three gauge fields Vi of SU(2)w (i = 1,2,3 corresponding to the generators of 
SU(2)w) and one field B, of U(1)y. As a result of the Englert-Brout—Higgs mech- 
anism, three combinations of the fields Vi and B, are massive and describe W=- 
and Z-bosons, 


Wr = an FiV?), (B.1) 
EEEE By (B.2) 
Vern A T l 


The fourth combination, 


A= (B.3) 


1 173 
Verge! Vi +gB,), 


remains massless and describes the photon. The relation between the fields Z, and 
A,, and original gauge fields is also written in the form 


Zp = COS Oy : V3 — sin Îw - By, 


Ap = cos Oy - By + sin Oy - V3, 


where Ow is the weak mixing angle, 


/ 
tan6,, = 2 
g 


The experimental value of sin Oy, is? 
sin dy = 0.481. 


Table B.1 contains dimensions of representations of vector fields and their 
Abelian charges. Note that in this Table as well as in a number of subsequent 


?Here and in what follows, if not stated otherwise, we omit important details related to radiative 
corrections. 
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Table B.1. Dimensions of representations and charges 
of gauge and Englert—Brout—Higgs fields; symbol 0* indi- 
cates that fields B, and A, are gauge fields of groups 
U(1)y and U(l)em, respectively. 


Field\ Group SU(3)e SU(2)w U(1)y U()em 


Gu 8 1 0 0 
Vu 1 3 0 
Bu 1 1 0* 

E al ae 
Zu 1 0 
Au 1 o” 
H 1 2 1 


8 ve 
Gy, = ee 
a=1 
3 ji 
Va = I: 
i=1 


where à are the Gell-Mann matrices, and rt are the Pauli matrices (A%/2 and 7# /2 
are generators of SU (3). and SU (2)w, respectively). 
Fermion fields “b” and “c” form three generations of quarks and leptons, 


I: u, d, ve, e 
II: ¢, 8, uuh 


Ill: t, b, v,, T: 


Particles within a single generation are discriminated by gauge interactions (they 
have different gauge quantum numbers), while the three “partner” particles of dif- 
ferent generations (e.g., u-, c- and t-quarks or electron, muon and r-lepton) have 
the same gauge quantum numbers but different masses and Yukawa couplings to 
the Higgs boson. 

One way to describe fermionic fields in (3 + 1)-dimensional Minkowski space? is 
to introduce left and right two-component (Weyl) spinors Xz and Xr. These spinors 
transform independently under the proper Lorentz group.* 


3We leave without discussion the description of fermion fields in curved space-time. It is considered 
in the accompanying book. 

4More precisely, they transform according to fundamental (xz) and antifundamental (x£) repre- 
sentations of group SL(2, C), which is the universal covering group for the proper Lorentz group 
SO(3, 1). The covering is two-fold; this property is responsible, in particular, for the fact that not 
fermion fields themselves, but their bilinear combinations are physical observables. 
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Of the two-component spinors one can construct Lorentz scalars, vectors and 
tensors. In particular, one can show that bilinear combinations 


XLIO?X1; Xpio2Xn, 
are scalars, and 
Nee es LO Xr 
are vectors. Here 
o” = (1,0), co =(l,-o), 
where ø are the conventional Pauli matrices acting on the Lorentz indices. 


Problem B.1. Check the validity of the above statements. Hint: Make use of 
the equivalence of the fundamental and antifundamental representations of SU (2); 
find the transformation law for spinors X, and Xr under the Lorentz boosts and 
3-dimensional rotations. 


The full Lorentz group, besides proper transformations (boosts and rotations), 
also contains reflection of space (parity transformation) P and inversion of time T. 
Two-component spinor X; (or Xr) does not transform to itself under spatial reflec- 
tion. The relevant representation of the full Lorentz group is in terms of 4-component 
Dirac spinor ~. The Dirac spinor includes two 2-component Weyl spinors x; and xp, 


XR 
The free Dirac field obeys the Dirac equation 


iO = mob, 


where m is fermion mass and y“ are four 4 x 4 Dirac matrices obeying anticommu- 
tation relations 


WATTEN 


In the chiral (Weyl) representation, the Dirac matrices are 
ü 0 ot 
qS : 
or 0 
In this representation, the Dirac equation is written in the matrix form, 
0 io On, XL XL 
=m 
io" On 0 XR XR 


Note that in the massless case, m = 0, the Dirac equation splits into two separate 
equations for each of the components Xz, Xr; massless solutions vy, and x, are 
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eigenfunctions of the helicity operator P TPT Z with eigenvalues —1 and +1, respectively.” 
Therefore, in the massless case the minimum option is to introduce one 2-component 
spinor xz, so that the theory contains only particles of left helicity and antiparticles 
of right helicity. This is precisely the way neutrino is described in the Standard 
Model. Of course, parity is broken in such a situation. 

To describe the Standard Model interactions in terms of the Dirac 4-component 
spinors, it is necessary to extract the components X, and Xr. This is done by using 
projection operators 


1Fy 
hsara Pany 


In the chiral representation 


In what follows we use the notations 


ts 


be =P p= 5y, Yr =P= Fy. (B.4) 


In the chiral representation of the Dirac matrices 


a) (2) 


For some applications a useful observation is that x$ = io2x% is left spinor. 
Problem B.2. Prove the last statement above. 


If not otherwise stated, we use 4-component spinors in what follows. The most 
commonly used Lorentz structures bilinear in fermion fields are 


wy, scalar, wy, vector, 


= _ B.5 
wy, pseudoscalar, Wyy"w, pseudovector, m 


where 


is the Dirac conjugate spinor. 


Problem B.3. Check the validity of the Lorentz assignment (B.5). Express these 
structures in terms of the Weyl fermions. 


5In both massless and massive cases, helicity is the projection of spin onto the direction of motion. 
However, helicity is Lorentz invariant only for massless fermions. 


486 Standard Model of Particle Physics 


Within the Standard Model, neutrinos have only left components, in contrast to 
quarks and charged leptons. With respect to strong interactions, both left and right 
quarks form fundamental representations (triplets), so that from the standpoint of 


strong interactions the separation of quarks into left and right components is not 


required. On the other hand, right quarks and right charged leptons are singlets 


under SU (2)w, while the left fermions form doublets 


o fu fe O t 
Qi= a}? Q2= al Q3 = E 
L L L 


Ly 


II 
A 
`o c 
Naun 
fa 
E 
N 
II 
Yor 
~~ 


Similarly, the convenient notation for the right fermions is 


Un = UR, CR, tR; n=1,2,3; 
Dn = dR, SR, OR; 


En = ER, HR, TR. 


Vr 
’ L3 = 
H L s L 


(B.7) 


Dimensions of the fermion representations and Abelian charges are given in 


Table B.2. 


The scalar Englert-Brout-Higgs field H is singlet under the group of strong 
interactions, doublet under SU (2)w and carries U(1), charge +1. These properties 


are reflected in Table B.1. 


Table B.2. Dimensions of representations and charges of 
fermions of the first generation; fermions of the second 
and third generations have exactly the same quantum 


numbers. 


Field\ Group SU(3)e SU(2)w U(1)y U (Ijem: 


e o 2 a 
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In terms of the fields explicitly covariant under the gauge group SU(3). x 
SU(2)w x U(1)y the Standard Model Lagrangian reads 
1 wo l min l mi 
LSM = =g Gw G = z EVV = qPwB 
+ iLnD”YyLn + iEn D yu En + Qn D YuQn + iUn D” YypUn + iDnD yu Dn 
— (Yl lH En + Yo, QmH Dn + Yuin QmHUn + h.c.) 
vN? 
+ D,H'D"*H — A (mu - >) (B.8) 
Here the first line contains gauge fields, whose strength tensors are 
Buy = O,By — 0 Bis 
Viv = bp Vo — OV, — ig[ Vp, Vo], 
Gav = O.Gy — Gy — igs[Gp, Gr], 


where the square brackets denote commutators; for instance, [V,,, Vy] = Vp Vy — Vy Vp- 
The field B, is real and fields V,,, G, are Hermitean. In terms of real fields Gf, and Vi 
1 


TWV V VV 


1 
TrG ur GH” = =G4, G9” =5 


a 2 Hv ’ 
with 
Gt, = 0,G3 — 8 G4 + gf?’ G>GS, Vi, = O.V3 — OVi + g VIV},  (B.9) 


where f° and e/* are the structure constants of SU(3) and SU(2), respectively. 
(e)* is a completely antisymmetric symbol.) The last terms in (B.9) are responsible 
for gluon self-interactions and for interactions between W+-, Z-bosons and photons. 

The second line in (B.8) includes free Lagrangians of fermions and fermion cou- 
plings to gauge fields. Covariant derivatives entering there are uniquely determined 
by gauge group representations of fermions and their U(1)y-charges: for fermion f 
in representations T, and Tw of SU(3). and SU(2) 


; apma iq $ i ; Y, 
Dif = (a, = igs T; Gu 10T Vy = id LB, ) ue 


where T? and T%, are generators of SU(3). and SU(2)w and Yp is U(1)y-charge 
of this fermion. For quarks T} = \*/2, while for leptons T? = 0. For left doublets 
(B.6) we have TŻ, = r'/2 and for right singlets Ti, = 0. Note that summation over 
generations is assumed in the second line of (B.8). In terms of the fields used in 
(B.8), gauge interactions are diagonal in generations. 

The third line in (B.8) describes the Yukawa interactions of fermions with the 
Englert-Brout—Higgs field H; h.c. means the Hermitean conjugation and summa- 
tion over repeated indices m,n labeling generations is assumed. Yukawa coupling 
matrices Y;!,,,, Y,4,, and Y,% 


mn? mn 


are complex and not diagonal in generations. Below we 
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briefly discuss the consequences of this property. We emphasize that the Lagrangian 
(B.8) does not describe neutrino oscillations: the relevant Yukawa terms are absent. 
We consider in Appendix C extensions of the Standard Model capable of describing 
the phenomenon of neutrino oscillations. 

Let us make a remark concerning the third line in (B.8). Like the complete 
Lagrangian of the Standard Model, this line is invariant under SU (3)e x SU(2)w x 
U(1),. To see this explicitly, we begin with the first term in the third line. Left lepton 
and Englert-Brout—Higgs field are doublets under SU(2)y,, while the right lepton 
is singlet. Therefore, the first term has SU(2)w-structure (L'H)E; it is SU(2)w- 
singlet. According to Table B.2, the total U(1),-charge of the fields in this term is 
zero, which means U(1),-invariance. The situation with the second Yukawa term is 
similar. The last term includes 


He = tag? = eap H* Ë, (B.10) 


where a = 1,2 and €qg is the antisymmetric symbol. The field Ñ is in the fundamen- 
tal representation? of SU (2)w. Therefore, the third Yukawa term is SU (2)w-singlet. 
It is also invariant under U(1)y, this is the reason for using H there rather than 
the field H itself. 

It is important to emphasize that the second and third lines in (B.8) give the 
most general gauge invariant renormalizable Lagrangian for the Standard Model 
fermions. In particular, explicit mass terms of fermions, which would also have the 
Lorentz structure (fz): fr + h.c., are forbidden by the invariance under SU (2)w x 
U(1)y. 

The last line in Eq. (B.8) is the Lagrangian of the Englert-Brout—Higgs field 
itself. According to Table B.1, the covariant derivative of the scalar field is 


Toig 
DH = On — ig Vya E3 By H. 


The scalar potential of the theory — the last term in (B.8) — has minimum at 
nonzero value of the Englert-Brout-Higgs field such that 


2 
v 

HH = —. 
2 


Using gauge invariance, one can show that the Englert-Brout-Higgs vacuum and 
the perturbations of the scalar field about this vacuum, without loss of generality, 


6H* is in anti-fundamental representation. The relation (B.10) gives isomorphism between the 
anti-fundamental and fundamental representations. Such an isomorphism exists for SU (2) but not 
for SU(N) with N > 2. 
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can be written in the form (unitary gauge) 


0 
H(z) = ( 7 9) ‘ (B.11) 
AtA 

Thus, there is only one physical scalar excitation about the Englert-Brout-Higgs 
vacuum. This is the Higgs boson, described by the field h. 

The Englert-Brout-Higgs vacuum breaks the gauge symmetry SU (2)w x U (1)y 
down to U(1)em. Conversion from explicitly SU (2)w x U(1)y-invariant fields to 
physical vector fields in this vacuum is accomplished by the change of variables 
(B.1), (B.3). Indeed, upon this change, free gradient terms of the fields W7, Z, 
and A, maintain canonical form 


= Sanvi — OV)? + (0, By — 3 Bp)? 
i g g A ; 
= -3 (3LWt — ð W ("W~ © = oW- ”) — gZ = gE F”, 
where 
Fv = pAr — WAp, Zyv = OZ — WZ y- (B.12) 


The vacuum expectation value in (B.11) leads to mass terms in quadratic 
Lagrangian about the Englert-Brout—Higgs vacuum, 

(g? +.9?)v 

a oO : 

This is precisely what the Englert-Brout—Higgs mechanism is about. The change of 
variables (B.1), (B.3) is chosen in such a way that these mass terms are diagonal. 
Thus, the masses of W=- and Z-bosons are 


gu vyg? +g? Mw 
Mw = =, Mz = —= f 
2 2 cos Oy, 
The Yukawa couplings are responsible for the fact that quarks and charged 
leptons are massive as well. The masses of fermions are given by 


sage ty, 


V2 
where yf are eigenvalues of the matrix of Yukawa couplings entering (B.8). The 
only fermions that remain massless are neutrinos. (The latter property is in fact a 
shortcoming of the Standard Model.) 

The Lagrangian of the Standard Model in terms of physical fields is obtained by 
substituting (B.1), (B.3) and (B.11) into (B.8) and performing the transformation 
of fermion fields from the original basis in which gauge interactions are diagonal, 
to the basis where diagonal structure is possessed by the fermion mass matrix (and 
Yukawa terms). As we discuss in Sec. B.3, the latter transformation gives rise to 


2,2 
gv = 
DyH'DYH r + 
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the Cabibbo—Kobayashi-Maskawa (CKM) matrix Vn» describing quark mixing in 
weak interactions. Note that we often use the same notation for fermions in the two 
bases. 

It is convenient to present the full Lagrangian of the Standard Model written in 
terms of physical fields as a sum of several terms, 


Lsm = Lecp +4 fe hie + Lyem+Lyweak + Ly + Ly + Ly + Lit. (B.13) 


Here 


1 a a uv ai ek a 
Loon = -73CC MY £ 5 a (iu -ma = ig Gi ) a 


quarks 


is the Lagrangian containing quarks, gluons and their interactions with each other; 
summation runs over all types of quarks. This is the Lagrangian of the theory of 
strong interactions, quantum chromodynamics (QCD). The second term in (B.13) 
is the free Lagrangian of leptons 


L = y In(ty"On — m1, ln + 5 Dni Ouvn- 
n n 


Here n runs over generations, ln = e, y, T. The third and the fourth terms describe 
electromagnetic and weak interactions of quarks and leptons, respectively, 


Lfem =€) qif Ag, 
f 


where 
t 


gg 


is the proton electric charge, so that eq, is the electric charge of fermion f; 


g 5 5 
L fweak = 2/2 Dav =) Wi ln + h.c.) 


e = gsin Îw = (B.14) 


g = 5) yt 
+ ai 2 (Em C — ° )W; Vmndn + h.c.) 


pI fal (+f (1 — +5) 2 in2 0 Z 
2 cos Ow LF (t3(1 — 7°) — 2ay sin” Ow) f Z. (B.15) 


Here the sum over f means summation over all quarks and leptons, if is the weak 


isospin equal to +1/2 for up quarks u, c, t and neutrinos and —1/2 for down quarks d, 
s, b and charged leptons. The first two terms in Lf weak are couplings of leptons and 
quarks to W-bosons (charged currents), the third term is the coupling to Z-boson 
(neutral currents). Note that the emission and absorption of W-boson changes the 
type (flavor) of fermion (and for quarks, generally speaking, the generation number 
as well, thanks to the non-diagonal CKM matrix Vmn), while the interaction with 
Z-boson does not change flavor. 
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Problem B.4. Check that the following relation holds for all fermions, 
qf = > + il. 


The term £y in the Lagrangian (B.13) describes the Yukawa interactions of 
fermions with the Higgs boson, 


Ly =->) “LF pn a=-So Fen, 
p v2 , 


where summation runs over all fermion species except for neutrinos. The Yukawa 
couplings are proportional to the fermion masses; this is of course a reflection of the 
fact that all fermions obtain masses due to interactions with the Englert-Brout— 
Higgs field. 

The term Ly in (B.13) contains the free Lagrangians of photons, W=- and 
Z-bosons and their interaction with each other, 


1 1 M2 
Ly = = FF FM — FZ ZY + -a Zoa" 


iar = ae _ 
a z Wal” t Mọ |W; lg + ge Wr m WtW, J” 
= aa sin Ow + Z"” cosOw)(W,, Wt — Wt W7), (B.16) 
where Fv and Z,,, are defined by Eq. (B.12), and 


Ww = (ô, + ieA, + igcosOwZ,)W, — (uv). (B.17) 
The relation between the electromagnetic coupling e and gauge couplings g, g’ 
is again given by (B.14). Note that the Lagrangian (B.16), as well as the full 
Lagrangian (B.13), is invariant under the unbroken gauge symmetry U(1)em, and, 
according to (B.17), W=-bosons carry electric charge +e. 

The term Ly describes the Englert-Brout—Higgs sector of the Standard Model, 


1 1 A 
a by +222 3_ Ap4 
Ly= 5 Ouhd h ghh Avh qh 


where 
mha = V 2v 


is the Higgs boson mass. 
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Finally, the term £4, stands for the Higgs boson coupling to the massive vector 
bosons, 


g tg? 


int g? 2 


2 2 '2 
g = g +g 
vhZ,Z" + TIW, [+ — Z2". 
To date, all particles of the Standard Model have been observed experimentally. 
The values of the Standard Model parameters are” [1]: 


Me = 0.511 MeV, My = 1.8 — 3.0 MeV, mq = 4.5 — 5.3 MeV, 
my, = 105.7 MeV, Me = 1.25 — 1.30 GeV, ms = 0.09 — 0.10 GeV, 
mr = 1.78 GeV, mM = 172.0 — 174.4 GeV, my = 4.15 — 4.21 GeV, 
Mz = 91.2 GeV, Mw = 80.4 GeV, mp, = 125.1 GeV, 
_ e 1 2 
v = 246.22 GeV, v= 7 D7 sin* 0w = 0.231, 


as(Mz) = 0.118. 


Uncertainties in quark masses (except for t-quark) are predominantly theoretical; 
they are due to the fact that quarks do not exist in free state. 


B.2. Global Symmetries 


In addition to gauge symmetries, there are global Abelian symmetries in the Stan- 
dard Model: the Lagrangian (B.13) is invariant under simultaneous phase rotations 
of all quark fields, 


q> 8q, g> eg (B.18) 


and independently under phase rotations of lepton fields of each generation, 


(Ye, €) + e*(ve,€), (De, 2) ae Da 2), (B.19) 
(Vp, u) = offr (va H), (Diap) = e 8p (Pius ft) (B.20) 
(vr, T) > tr (vr, T), (%,F) > e7 (0,7). (B.21) 


Here 8, Be, Bu and 8- are independent parameters of the transformations. The 
conserved quantum number associated with the symmetry (B.18) is baryon number® 


1 
B= 3 (Na — N3), 


7We omit the details related to the dependence of these parameters on the renormalization scale, 
except for the case of as. 

8Baryon number is conserved in perturbation theory only. Nonperturbative effects violate baryon 
and lepton numbers, but these effects are very small under normal conditions (but not in the early 
Universe, see Chapter 11). 
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where N, and Ng are total numbers of quarks and antiquarks of all types, respec- 
tively. The baryonic charge of quarks by definition is equal to 1/3, and the charge 
of antiquarks is (—1/3). With this assignment, the total baryonic charge of pro- 
ton, which consists of two u-quarks and one d-quark, equals 1. Hence, in terms of 
numbers of baryons and antibaryons we have 


where the summation runs over all types of baryons. A clear manifestation that 
baryon number is conserved in Nature with high precision is proton stability: proton 
is the lightest particle carrying baryon number, so it must be absolutely stable if 
the baryon number conservation is exact. Proton decay has not yet been discovered, 
and the experimental limit on the lifetime is [1] 


Tp > 10%" — 10%“ years, 


depending on the decay mode. 
Symmetries (B.19)-(B.21) are associated with three independently conserved 
lepton numbers (electron, muon and tau) 


Le = (Ne Ni.) (Ne+ Nosh (B.22) 
Ly = (Nu Ny.) (N+ ł N3, ), (B.23) 
L= (Nz NL,) (N; } Nr, ), (B.24) 


where Ne, Nvs, Ne+, Np, are numbers of electrons, electron neutrinos, positrons 
and electron antineutrinos, respectively, and similarly for other generations. 

A manifestation of the lepton quantum number conservation is the absence of 
processes violating lepton numbers, but otherwise allowed. An example of such a 
process is the decay 


H> ey. 


In this process, electron and muon numbers would be violated. The experimental 
bound on its branching ratio is [1] 


Br(u — ey) < 5.7 - 107". 


This is one of the best results showing the conservation of the muon and electron 
numbers. Note that the observed neutrino oscillations show that lepton numbers 
are actually violated in Nature; see Appendix C. 

In many models generalizing the Standard Model, baryon and/or lepton numbers 
are violated, which should lead to new physical phenomena, such as proton decay. 
The search for these phenomena is an important challenge for low-energy particle 
physics experiments. At present, besides neutrino oscillations (see Appendix C), no 
experimental evidence for violation of baryon or lepton numbers has been found. 
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B.3. C-, P-, T-Transformations 


Let us briefly consider discrete transformations: time reversal, 


T-transformation: (2x°,x) ma (—a°,x), 


spatial reflection (parity transformation), 


0 


P-transformation: (a,x) 4 (x? 


’ =x), 
and charge conjugation, 
C-transformation: f(x) = flx). 


P- and T-transformations, together with the proper Lorentz group, form the full 
Lorentz group. 

From the standpoint of scattering processes, time reversal means the replacement 
of initial state by final state and vice versa; spatial reflection implies the inversion 
of spatial momenta of all particles, and charge conjugation interchanges particles 
and antiparticles. In quantum field theory, the CPT-theorem is valid, which states 
that physical processes must be invariant under the joint action of all three trans- 
formations. One of its consequences is the equality of masses and total decay widths 
of particle and its antiparticle. 

Lorentzian scalars, vectors and tensors 
P-transformations. In the latter case they are called pseudoscalars, axial vectors 
and pseudotensors. For instance, scalar (parity-even) and pseudoscalar (parity-odd) 


9 can be even and odd under 


transform under P-transformation as 
P 
olx’, x) —* ¢ (x°,x) = o(2°, =x), 
and 
P 
(2°, x) E o' (1°, x) = —o(2?, =x), 
respectively. P-transformation of vector (parity-even) reads 
V, (2°, x) => V! (2°, x) = 8V (2, —x) — SLV; (2°, —x), 
while the P-transformation of axial vector is 
A, (2°, x) => Al (2°, x) = —8? Ao (2°, —x) + 54.A;(x°, —x), 
Several spinor bilinear combinations of definite parity are given in (B.5). 
It is clear from the form of the Lagrangian (B.13), (B.15), that weak interac- 
tions violate parity: weak bosons couple to both vector currents Wm7"Wn, and axial 
currents qm y"7>un. Moreover, weak interactions violate CP. The source of CP- 


violation is a complex parameter in the Cabibbo—Kobayashi-Maskawa matrix. On 
the contrary, strong and electromagnetic interactions violate neither P nor C. 


°We do not discuss here the properties of spinors under parity. 
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To understand how the Cabibbo—Kobayashi-Maskawa (CKM) matrix emerges, let 
us consider the transformation from the gauge basis of fermions to the mass basis. 
The Lagrangian (B.8) is written in terms of fields in gauge basis: all gauge interac- 
tions have diagonal form. As a result of spontaneous electroweak symmetry break- 
ing, the Yukawa terms in the Lagrangian (B.8) in the unitary gauge (B.11) give rise 
to fermion mass terms, 


Lm = — Yin Bim Ee z BY inden dry _ BY nil tnn + h.c., (B.25) 


as well as Yukawa couplings to the Higgs boson, 
h h - h 

Ly =Y ernir — indy, dng — HY tru ün +e B.26 

Y A tm CR — Ty tm Gn — Ty LmURn ( ) 


The gauge interactions with gluons, photons and Z-bosons are still diagonal in 
fermions, while interactions with W~-bosons are diagonal in generations, 


Ly = a Wa len + tt, "Wer di, HRe 


v2 v2 


The Yukawa couplings can be written in the form 


Pem pe Maal ne kn a 


where constants Yi, y and Y; are real, and U®+,..., UY“? are unitary matrices; 
summation over repeated index p is assumed. 
Problem B.5. Prove the last statement. 


It is precisely these matrices that transform the quark fields into the mass basis, 


= Ip 7 — zrdr J _ TTURA 
Lam i U mnlRna» drm = Crane tin, URm = U mnr 


=T] — rd J — JUL. 
lim B Unilin dim = Calne ULm = Cin: 


Indeed, in terms of fields with tilde, the terms (B.25) and (B.26) are diagonal; for 
example 


Upon this transformation, kinetic terms of the fermion fields and also gauge inter- 
actions with gluons, photons and Z-boson remain diagonal, while the interaction 
with the W~-bosons contains mixing matrices of generations. 
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Within the Standard Model, mixing in the lepton sector is unphysical: it can be 

eliminated by redefining the neutrino fields, 
Um > Um: Um = (UE) ph a 

Since all gauge interactions of neutrinos are proportional to unit matrix (in the 
space of generations), this redefinition does not give rise to mixing in the neutrino 
sector. The reason for the absence of mixing in the lepton sector of the Standard 
Model is that neutrinos are massless there. In reality, neutrinos are massive, and 
there is mixing between them. This is discussed in Appendix C. 

In the quark sector the situation is different, and the transformation to the mass 
basis leads to mixing between quarks of different generations in the interaction 
vertices with W+-bosons, 


Lw = I arny Wt Vinndin + Fallin Wa Vatn (B.27) 


v2 v2 


where Vi; = (U%);,'U = is the unitary 3 x 3 Cabibbo—Kobayashi—Maskawa mixing 
matrix. 

A 3 x 3 unitary matrix of general form is specified by 9 real parameters (the 
dimension of the group of unitary matrices U(3)). In terms of the orthogonal sub- 
group SO(3) C U(3) these parameters can be divided into 3 rotation angles and 
6 phases. However, five of the six phases are unphysical. They can be eliminated from 
(B.27) by redefining the fermion fields fa —> fret”. This phase rotation does not 
lead to complex coefficients in other parts of the Lagrangian (B.13), since fermion 
fields enter all its terms, except for (B.27), in combinations explicitly invariant under 
phase rotations, such as Fna fn. One remaining phase in the CKM matrix is the 
source of CP-violation in weak interactions (see below). 

In the standard parameterization [1] in which the matrix V reads 


C12C13 $12C13 sige “28 
= id id 
V = | —812€23 — €12893813e"° C12C23 — $12823813e"°8 523¢13 |, (B.28) 
. Bs 
812823 — €12€93813e"°8 —C12823 — $12C238130'°8 ~—€93€13 


where cij = cos bij, Sij = sin6;;, the parameters 0;; are mixing angles, and 613 is 
the CP-violating phase. We show in the end of this Section that the matrix V can 
indeed be chosen in this form. The values of the parameters have been determined 
with good accuracy from numerous experiments [1]: 


$12 = 0,2254, 513 = 0,0035, 
s23 = 0,04118, 513 = 69° + 5°. 
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The absolute values of the elements of the quark mixing matrix are [1] 


0.9743 0.225 0.0036 
IV] = | 0.226 0.9734 0.041 |, (B.29) 
0.0087 0.041 0.9991 


with experimental errors of the order of the last digit. Note that CP-violating phase 
of the CKM matrix is large. Nevertheless, the resulting CP-violating effects in the 
Standard Model are strongly suppressed due to the smallness of mixing angles, 
which is clearly seen in the standard parameterization (B.28). Note also that there 
is strong hierarchy between the entries in (B.29): the diagonal elements are close to 
1, while off-diagonal elements obey Vis, V31 < V23, V32 < Vi2, V21 < 1. 

The general fact that complex entries in Vmn give rise to C'P-violation, follows 
from the transformation law of fields under CP, 


P - P a17 

pr — MPY, Pe — N0, (B.30) 

b, D CPT, tS p(T, (B.31) 

where 77 is the sign factor (7? = +1), and C is the charge conjugation matrix such 


that pyty as —wy"w. For the bilinear combinations of spinors which form the 
charged current coupled to W-boson (see (B.27)), the above formulas yield a simple 
transformation law (we no longer write tilde over fields in the mass basis), 


= 0 CP > 0 = i GP = i 
UtmY Vinnd,, => =d, Y VmnUrm, ULm Y Vinnadrn — drn Y Vmnütm: 


Taking into account the transformation law of W-bosons under CP-conjugation 


we 2 wt, we 2 we, 


t 


we finally obtain that under CP-transformation, the interaction (B.27) converts into 


LGP = yen Wy Vanir + ain Wi Vindi, (B.32) 
Comparison of (B.32) with (B.27) shows that the coupling (B.27) violates CP if the 
entries of mixing matrix Vmn are complex-valued. 

This result reflects the general property of field theory: the initial Lagrangian 
transforms under CP into the Lagrangian with complex-conjugate couplings. If there 
are unremovable phases in the set of couplings, then CP is broken. 

As an illustration, let us estimate the difference of partial widths of decays 


t—=Wtb and t= W7b. 


These widths would be equal if CP were exact symmetry of the Standard Model. 
(CPT ensures that the total widths of particles and antiparticles are the same, but 
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Fig. B.1. Tree-level and one-loop diagrams giving the main contribution to the difference of 
partial widths of t — Wt+b and E — W~b in the Standard Model. The vertices are proportional 
to the elements of the CKM matrix which are explicitly indicated. 


does not forbid different partial widths.) At the tree level, the partial widths are 
identical, and the main contribution to their difference comes from the interference 
of the tree and one-loop diagrams. For the decay t ~ W*b, these diagrams are 
shown in Fig. B.1. 

Similar diagrams contribute to the decay t + W~b. 

If one denotes by V33Mt;ce the tree-level contribution to the amplitude of the 
decay t — W*b, then the one-loop term is 


2 


3 
Mtree : = 5 Van Vin Vis (Anı + in Bni), 


n,l=1 


where Anı, By, are real functions depending on masses of W-boson, t- and b-quarks, 
and also on masses of virtual quarks in the loop. In what follows we need only the 
function Banı, which is conveniently written in the form 


where 0(z) is the usual step-function. The imaginary part Bı of the diagrams 
is related to the kinematics of virtual processes, rather than complex-valuedness of 
couplings: it arises from the integration over virtual momenta near mass shell. Since 
this is a kinematic effect, it is the same for decays of t and t. Hence, the one-loop 
diagram for t > W~b gives 


2 


3 
* g x ž . 
Miree : I6n? 2 Ven Vin V13 (Anı + in Bni). 


The decay widths are determined by the absolute values of the amplitudes squared, 


|M (t > Wt)? = |Miree!? (B.33) 
g 3 
x L+ eza | Vas DO VanVinVis(Ani + inBn) + hc. | +O(g*) p, (B-34) 


In=1 
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|M (t = W~d)|? = |Miree|* (B.35) 
g? 3 
TARN F Va 2 Vè Vin V (Anı + inBnı) + h.c. | + O(g*) $. (B-36) 


In this way, we finally obtain to the leading order in g?: 


I(t - Wtb) —T(é- Wb) 
I(t — W+b) + T(t W-b) 


Acp = 


2 
= -E YO mV Ven Vin Vis) Bni. 
lyn 


Numerically, the relative difference of widths is tiny, 
Acp = 1.7- 107". 


Therefore, this particular effect is unlikely to be observed experimentally in fore- 
seeable future. We note in this regard that the phase in the CKM matrix leads to a 
number of other CP-violating effects (notably, in processes involving neutral kaons 
and B-mesons), some of which have been observed in experiments. We note also 
that the above mechanism responsible for the difference of partial widths of parti- 
cles and antiparticles is quite generic. In various theories generalizing the Standard 
Model, this mechanism is used to generate the baryon asymmetry (see Chapter 11). 


Let us show that one can make phase rotations of the quark fields in such a way that the 
mixing matrix is cast into the form (B.28). The relevant transformations are 


di, > din, n= 1,2,3, 
(B.37) 


ULn > e tr, m= 1,2,38, 


where Bn and yn are arbitrary real numbers. The only effect of these rotations is that the 
matrix V is multiplied on the right by the diagonal matrix diag(e™ et 2 e’3) and on the 
left by another diagonal matrix diag(e™ t? , e7172 , e7173), 

Let us begin with the two-flavor case, i.e., 2 x 2 matrix V. An arbitrary unitary matrix 
2 x 2 can be written as follows: 


_ nip iTg03 i(71 sin Xx+T2 cos x) 
V=e"e e ; 


where 7; are Pauli matrices. (A general prescription is to write a product of exponentials 
of all generators of the unitary group with arbitrary coefficients, where the generators may 
or may not combine into linear combinations in the exponents.) The first two factors here 
are diagonal, and they can be removed by the rotation (B.37). The remaining matrix is 


ix 
eft sinx+7T2cosx)p __ c se 
= ; y 
—se™’X c 


where s = siny, c = cosy. It can be written in two equivalent forms: 


ex 0 eX 0 1 0 1 g 
-0. and _ |-:O- = Aull (B.38) 
0 1 0 1 0 e ™ 0 ex 
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om (“ `) (B.39) 


is a real matrix. The first and last factors in (B.38) can again be removed by the quark 
phase rotations. Thus, the mixing matrix in the two-flavor case can be chosen real and 
given by (B.39). 

We now turn to the three-flavor case. The matrix V can be written as 


where 


V = oh 19393 @48% U1 5 Ue7U io, (B.40) 
where 


Uis = e1 sin x+A2 cos xX)012 Uss = etl sin +A5 cos €)093 Uey = etre sin €+A7 cos €)013 i 


Here àa, a = 1,...,8 are Gell-Mann matrices. The matrices A3 and As are diagonal, so the 
first three factors in (B.40) are again irrelevant. The remaining Gell-Mann matrices are 


T2 0 1 0 
À1,2 = » A= i 
0 1 0 T1,2 


0 0 1 0 0 —i 
AX =]0 0 OJ, Av=]O 0 0 
1 0 0 i 0 0 
In analogy to the two-flavor case, we write 
Ui2 = diag(1,e**, 1) - O12 - diag(1, e’*, 1), (B.41a) 
Uss = diag(1,1,e “*) - Oas - diag(1, 1, e*), (B.41b) 
Usr = diag(e“*, 1, 1) - Oo7 - diag(e™*®, 1, 1), (B.41c) 
where 
C12 sı2 0 1 0 0 C13 0 sig 
O12=|-Si2 ci2 O}, Oss =]0 c3 $23], Os7 = © 1 O07, 
0 0 1 0 —se3 C23 —sı3 0 c13 


and notations coincide with those used in (B.28). The last diagonal factor in (B.41a) and 


first factor in (B.41b) can be removed by the rotations of quarks, so equivalent form of 
V is 


V = O15 - diag(e’s, die) . Osrdiag(e “Se **, 1) - O12. 


We perform additional rotations of quark fields which multiply the matrix V on the left 
and right by diagonal matrices which commute with O45 and O12, respectively, 


diag(e’®,1,1)- V - diag(e?, e’7, e?) 
= 0145: diag( to, Í; e’‘) - O67 ` diag(e07® ; e00 i) Oy. 
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We find by direct computation that 


diag(e ®t 1, ef) - Og7 - diag(e7—®), 07% etf) 
cazet lt) 0 siget ta+8) 
= 0 eilY—-x) 0 
— s1676) 0 ciet Pto) 
We now choose a y x, B ¢ and find that the only remaining phase factor 


multiplies 513, and the phase is ô = £+ a + 8 = —(y + Ç — £). Thus, the CKM matrix can 
be written as 


ii 0 0 C13 0 sige C2 s2 0 
V=ļ|0 c23 s23 0 1 0 —si2 ci2 0f, (B.42) 
0 —se23 C23 —si30e t? 0 C13 0 0 1 


which coincides with (B.28). 


B.5. Effective Fermi Theory 


Processes at low energies, E « Mw, are well described by the effective Fermi theory. 
The Lagrangian of this theory is obtained from (B.13) by integrating out massive 
vector fields. 

In the Standard Model, the interaction of massive vector bosons with fermions 
has the form (see (B.15)) 


g NC CC 
Loe ine z z i Zh JXW 4+ h.c.), 
Tupak =S cosby, ” 7 sah re) 


where the neutral a © and charged IG C currents are 
pe 2 Fru(td (1 — 7°) — 2qp sin? Ow) f (B.43) 


Joo = Ta (1-7 )lm +X tm Yu(1 — 7°)Vinn dn (B.44) 


m,n 


At E < Mz there are only fermions (leptons and quarks, except for heavy t-quark) 
in the initial and final states. At these energies, the main contribution to the weak 
amplitudes comes from single exchange of virtual massive vector boson. In the 
language of Feynman diagrams “integrating out” means shrinking the propagators of 
W- and Z-bosons to a point (see Fig. B.2). As a result, the Standard Model diagrams 
of the type shown in Fig. B.3(a) transform into diagrams shown in Fig. B.3(b). The 
latter correspond to the Fermi theory of effective four-fermion interaction. Thus, 
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Fig. B.2. Point-like approximation to the propagators of massive vector bosons. 
Z,W GF 
— > 
(a) (b) 
Fig. B.3. The relation between diagrams of the Standard Model and of the Fermi theory. 


integrating out the Z-boson gives rise to the interaction 


F INC NC 
L CE; JNO e, 
- v2 i 


while integrating out W~-bosons yields 


GF joc CC } 
Lc = = u d H 
The effective coupling Gr, Fermi constant, has dimension m~?; it determines the 
strength of the four-fermion interaction. This coupling is related to the fundamental 
parameters of the Standard Model as follows (see Fig. B.2) 


g 
CoS — 2 —; 
4V2 M2, 
Its numerical value is 
Grp =1.17-10-° Gev~?. 


Note that similar procedure of integrating out the Higgs boson also leads to effective 
four-fermion interaction of the form WmWm ` YnYn. However, the effective couplings 
here are proportional to the corresponding fermion Yukawa couplings, which are 
small compared to the weak gauge coupling.!° Therefore, the impact of this effective 
interaction on processes at low energies can be neglected. 


B.6. Peculiarities of Strong Interactions 


Although mediators of strong interactions, gluons, are massless, strong interactions 
are also described by effective theory at low energies (more accurately, at small 
momentum transfers Q). In contrast to the Fermi theory, the reason is that QCD is 


10 An exception is the t-quark Yukawa coupling, but t-quark does not participate in low energy 
processes discussed here. 
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in strong coupling regime at energies of order Agcp ~ 200 MeV, and even somewhat 
higher. 

The energy-dependent (“running”) strong gauge coupling as(Q) = g2(Q)/(47) 
increases as energy decreases, and becomes large at Q ~ Agcp. (At the one-loop 
level it tends to infinity as Q — Agcp.) Thus, at energy E ~ Aggp, not only the 
perturbation series in gauge coupling blows up, but the very description in terms 
of quarks and gluons loses any sense. 

This feature of the theory is fully consistent with the fact that quarks and gluons 
are not observed in free state. They are bound inside colorless particles — hadrons 
(mesons and baryons) — and begin to play an independent role only in processes 
with characteristic momentum transfer higher than Agcp. This phenomenon got 
the name confinement of quarks and gluons. It is clear that the characteristic size 
of light hadrons — regions where u-, d- and s-quarks and gluons are confined — is 
precisely Age p: 

At low energies, interactions of the lightest hadrons (protons, neutrons, pions, 
kaons, etc.) between themselves and with leptons and photons are well described 
in the framework of the chiral perturbation theory. To describe heavier hadrons 
and strong processes in the intermediate energy region E 2 Agcp, various phe- 
nomenological approaches are used. A number of variables characterizing QCD in 
the strong coupling regime can be calculated from the first principles by putting the 
theory on the lattice and calculating the functional integral numerically by Monte 
Carlo method. From the point of view of cosmology, of special interest are lattice 
calculations of QCD transition temperature which give Tocp ~ 170 MeV [272]. 


B.7. The Effective Number of Degrees of Freedom 
in the Standard Model 


Using the particle spectrum of the Standard Model and accounting for peculiar- 
ities of strong interactions, it is straightforward to estimate the effective number 
of relativistic degrees of freedom g, as a function of temperature of the primordial 
plasma. The simplest way is to use step-function approximation to the temperature 
evolution of g,(T’) near particle thresholds and QCD transition. The result is shown 
in Fig. B.4. 

At temperature T < 100 MeV, only photons, electrons and neutrinos are rela- 
tivistic, so at 1 MeV < T < 100 MeV the effective number of relativistic degrees of 
freedom is 


7 43 
g«(T £ 100 MeV) = 24 + 5 (4e +3- 2,) = Z = 10.75. 


At T = 100 MeV, additional contribution comes from muon. Above the temperature 
of the QCD transition Tgcp ~ 170 MeV, the plasma contains light quarks (u, d, s) 
and gluons. Lattice calculations show that their interactions do not affect dramati- 
cally the thermodynamic quantities such as energy density or pressure. Therefore, 
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Fig. B.4. The effective number of degrees of freedom in the primordial plasma as a function of 
temperature. Only Standard Model particles are taken into account. 


at T ~ Toon the effective number of relativistic degrees of freedom changes by 
7 
Agee”) ~8-245-3-3-4= 47.5. 


The first term here comes from gluons (massless vectors in eight color states), the 
second one is due to u-, d-, s-quarks and antiquarks, each in three color states. Steps 
at higher temperatures are associated with heavier particles (see the SM spectrum 
given in the end of Sec. B.1), as shown in Fig. B.4. 

At T > 200 GeV the effective number of degrees of freedom, in the framework 
of the Standard Model with one Englert-Brout—Higgs doublet, is 


gx(T Z 200 ) 
7 
= 2,4+2-3y4+32+1,+8 ‘2a + 3(8-4e+3- 2, +6: 39. 4g) (B.45) 
= 106.75. 


Here subscripts indicate types of particles, superscript (c) refers to the number of 
color states, the last factors are the numbers of spin states. Note that in the above 
calculation, three polarizations of W- and Z-bosons and one degree of freedom of 
the Higgs boson are taken into account. This is appropriate for the Standard Model 
in the Englert-Brout—Higgs phase. In the phase of unbroken electroweak symmetry, 
W- and Z-bosons are massless and have 2 polarizations each, while the Higgs field — 
complex doublet — describes 4-scalar particles. The number of degrees of freedom 
is the same in these two phases, so the result (B.45) is valid in any case. 


Appendix C 


Neutrino Oscillations 


Neutrino oscillations — transitions that change neutrino flavor — is a unique to date 
direct evidence for the incompleteness of the Standard Model of particle physics, 
obtained in laboratory experiments rather than from cosmology or astrophysical 
observations. Neutrino oscillations occur if neutrinos are massive and there is mix- 
ing between lepton generations analogous to quark mixing considered in Sec. B.4. 
Within the Standard Model, the Lagrangian cannot contain renormalizable gauge- 
invariant terms which would lead to neutrino masses. In this sense, neutrino oscil- 
lations are phenomenon beyond the Standard Model. 

Historically, the first data pointed at neutrino oscillations were obtained in mea- 
surements of solar and atmospheric neutrino fluxes. These discoveries have been 
confirmed afterwards by experiments with neutrinos from nuclear reactors and 
accelerators. 


C.1. Oscillations and Mixing 


In this Section we discuss in general terms the mechanism leading to neutrino oscil- 
lations. Here we set aside the aspects related to the fact that neutrino has spin 1/2. 
These aspects are discussed in Sec. C.4. 


C.1.1. Vacuum oscillations 


In extensions of the Standard Model allowing for nonzero neutrino masses, there can 
occur oscillations between neutrinos of different types (flavors). Namely, neutrinos 
are produced in weak processes in full compliance with the Standard Model.! As an 
example, neutron beta-decay n — pe* ve produces electron neutrino, muon neutrino 
is created together with muon in the decay 7* — u* vp, while hadronic decays 
of 7~-lepton yield t-neutrino. However, in the basis ve, Vu, Vr, the Hamiltonian 
describing neutrino free propagation is non-diagonal, which leads to oscillations. 


lIn this Appendix we are interested in processes at not too high energies, for which this is certainly 
the case. 
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We call this flavor (or gauge) basis, since gauge interactions of leptons are flavor- 
diagonal in this basis. 

The situation here is quite analogous to quark mixing. In the basis where the 
gauge interactions of quarks are diagonal (gauge basis), the quark mass matrix is 
non-diagonal. Vice versa, in the basis where quark mass matrix is diagonal (mass 
basis) gauge couplings are non-diagonal. In the case of quarks it is convenient to 
work exclusively in the mass basis. When studying neutrino oscillations, one uses 
both gauge and mass basis. 

We assume here that there are three types of neutrinos in Nature. Since the 
creation and detection of neutrino occur via weak interactions, it is the states of 
the flavor basis |ve), |v,,) and |v,) that are observable. The basis vectors of the 
flavor basis |Va), a = e, u, T are related to the basis vectors of the mass basis |1;), 
i = 1,2,3 by unitary transformation traditionally written in the form 


vi) = UailYa)- (C.1) 


(Summation over repeated indices is assumed.) Here the unitary matrix U,; is the 
mixing matrix in the neutrino sector, dubbed Pontecorvo—Maki-Nakagawa-Sakata 
(PMNS) matrix. The states |v;) are eigenstates of the free Hamiltonian, i.e., they 
have definite masses m;. The inverse transformation from the mass basis to the 
flavor basis reads 


Ya) = (U")ialvi) = Užili). (C.2) 


With the definition (C.1), the neutrino mass matrix in the flavor basis has simple 
form, 


Mag = Va|M|vg) = (UM ™U") aa, (C.3) 
where M(™ is diagonal mass matrix in the mass basis, 


The neutrino free evolution in the rest frame is determined by eigenvalues of the 
mass matrix, 


|v; (t)) =e" "|v; (0). (C.5) 


Suppose that at time t = 0 there is a pure flavor state — for instance, electron 
neutrino |v.) produced in the decay of (possibly virtual) W+. Then at time t, other 
components of the state vector in the flavor basis also become nonzero. This implies 
a non-vanishing probability of detecting muon or 7-neutrino at that time. 

For practical applications, one needs to calculate the transition probability Va —> 
vg in the laboratory frame at a distance L from the source of neutrino va. Then 
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the formula (C.5) for neutrino evolution in the mass basis has to be generalized, 
[vy (t, L)) =e EPH) |y5(0)), 


where p; and FE; are neutrino momentum and energy. Realistically, neutrinos are 
ultra-relativistic. Let us assume that neutrino energy is fixed,” and write in the ultra- 


relativistic case pj = 4/ E? — m? = E — m4 /2E. Omitting phase factor common to 
all neutrino species, we find that the evolution of states in the mass basis in terms 
of the traveled distance is given by 


m2 
\v;(L)) =e" *|v;(0)). 
Note that this evolution corresponds to the effective Hamiltonian 


M2 


ee 


(C.6) 


where M is the neutrino mass matrix, which takes the forms (C.3) and (C.4) in the 
mass and flavor basis, respectively. As follows from (C.1), the transition amplitude 
of neutrino Va to neutrino vg is equal to 


A(a = B) = $ (velvi (L) (v (0)lva) = S vplrjet#* (v5|¥a) 


J j 
m? 
= X_ Upe ŻE UX]. (C.7) 
J 


This formula enables one to calculate the probability of transition between two 
states of the flavor basis after traveling the distance L: 


P(¥q > vp) = |A(a > B))? 


= bap — 49 [Re[U3jU aj VaiU Gi] si | ae 
jou 
+2) Im[U3 Uj ai5il sin | 3-2 (C.8) 
jou 
where 
Am‘; = m? — m? 


2There is a discussion in literature of delicate issues like “are neutrino states eigenstates of the 
energy operator Po or the momentum operator Ê ?” The answers to questions of this sort are 
important for correct description of the oscillations of not too fast neutrino, as well as for studying 
the applicability limits of the oscillation picture. (It is obviously not valid at large distances from 
the source where neutrinos of different masses come in substantially different times.) We do not 
enter this discussion here. 
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The expression (C.8) describes oscillations with amplitude determined by the neu- 
trino mixing matrix and the oscillation lengths depending on the difference of neu- 
trino masses squared and energy. Note that in realistic situations the oscillation 
pattern may be washed out if the source has large spatial size and/or averaging is 
performed over a certain interval of neutrino energies. 

It is clear that along with the oscillations of neutrinos there should be oscilla- 
tions of antineutrinos. The latter have indeed been observed experimentally. Anti- 
neutrino oscillations are also described within the above formalism. CPT-theorem 
(see Sec. B.3) gives the following relation between the probabilities of neutrino and 
antineutrino transitions, 


P(Vq > vg) = P(g > Da). (C.9) 


The transition probability of vg — Va is equal to the transition probability of 
Va — Vg calculated with complex conjugate neutrino mixing matrix (see (C.8)). 
Hence, the relation (C.9) leads to equality 


P(Da > Dg ; U) = P(va > vg 3 U"). 


This equality implies that neutrino and antineutrino oscillation probabilities may 
be different only if the matrix U is complex (see Eq. (C.8)). This would mean CP- 
violation in lepton sector. Note that non-trivial CP-phase is possible only if the 
number of neutrino species exceeds two: in the case of two types of neutrinos, the 
mixing matrix U can be made real by redefining the fields (see below). 

An important example is the case of oscillations between two types of neutrinos. 
In this case 2 x 2 unitary matrix Uai, i,@ = 1,2 is determined by 4 real parameters. 
Namely, we have shown in the end of Section B.4 that any unitary matrix 2 x 2 can 
be written in the form U = D1ODz, where D1 2 = diag(e’*!?, e’X1.2) are diagonal 
matrices of phase rotations (one of the phases can be set equal to zero), and O isa 
real orthogonal matrix. The formula (C.8) shows that matrices D12 are irrelevant 
for oscillations, so the transition from flavor to mass basis is effectively described 


by the matrix 
cos@ sin 
Vai = : (C.10) 
—sinf cosé 


which depends on one parameter, mixing angle. 
In the case of two-neutrino oscillations, the formula (C.8) gets simplified, 


2 
P(va > vg) = bag + (—1)**4 sin? 20 sin? (= L). (C.11) 


In other words, the probability of transition of neutrino Va to another type of 
neutrino vg is equal to 


A 2 
P(¥q > Ve¢a) = sin? 20 - sin? ( TF 2) , (C.12) 
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while the survival probability of neutrino vq is 


A 2 
P(va > Va) = 1 — P(Va > Ve¢o) = 1 — sin? 20 - sin? ( m r). (C.13) 


The mixing angle @ determines the oscillation amplitude, A = sin? 20. The oscilla- 
tion length is 


An FE E ev? 


TO gs EY” a ae 


(C.14) 


At this distance the neutrino v, returns to its original state, while the maxima of 
the oscillation probability are at distances Lk = Lose(1/2 +k), k = 0,1,2,.... 
C.1.2. Three-neutrino oscillations in special cases 


In the three-flavor case the mixing matrix can be cast into the form (see end of 
Sec. B.4) 


UPMNS — D U Do, (C.15) 


where Dj are again diagonal matrices of phase rotations, Dı = diag(e’™, ec’, 
e23), Do = diag(e’®1, e’2 etfs), and matrix U has the same form as the matrix V 
in (B.28), (B.42): 


C12 $12 0 C13 0 siget? iL 0 0 
U = —S12 C12 0 0 1 0 0 C23 $23 
0 0 1 —sy3e7%° 0 C13 0 —523 C23 
€12C13 812C13 s1367" 

= | —512C23 — 128238136" C1223 — $12823813e"°  s23C13 |. (C.16) 
$12523 — €12€23813e —€12523 — 812238130 €23C13 


Some phases in Dj,2 are physical (see Sec. C.4), but just like in the two-flavor case, 
the formula (C.8) shows that these phases are irrelevant for oscillations. Thus, in 
the three-flavor case the oscillations are described by three mixing angles and one 
phase which enter (C.16). With slight abuse of terminology, we call the matrix U 
of the type (C.16) as PMNS matrix. 

As we discuss later, there is a hierarchy between the differences of neutrino 
masses squared, 


|Am3,| > Ama). (C.17) 
This implies, in particular, that 


jamal = [Ams = Amd, | ~ [Amd | > Amd. 
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Fig. C.1. Normal (a) and inverted (b) hierarchies of neutrino masses. 


Hereafter we stick to the following convention concerning the numbering of the mass 
eigenstates: the mass mg differs significantly from mı and mg; the masses mı and 
mz are close to each other, and 


mz > mı. 


The property (C.17) suggests either normal or inverted mass hierarchy, as illustrated 
in Fig. C.1. Let us show that due to this property, the formulas describing the 
oscillations between the three types of neutrinos, are similar in two special cases to 
those of two-neutrino oscillations. These cases are, in fact, of great interest, since 
the first of them often occurs for accelerator and reactor neutrinos, while the second 
one is relevant for solar neutrinos. 

Let us begin with the case when the energy E and the distance between the 
neutrino production and detection are such that 


Am3, 
——L al. C.18 
2E = ( ) 


Then the first oscillating term in (C.8) is expressed as 


* * PR. Am‘; 
XO Už; UsjUaiU}i + sin T 


j>i 
= UžUg3(Ua1U 51 + Ua2U $2) > sin? (Aur). (C.19) 

Let us now take into account the unitarity condition 
(UU )ag = X Vail ii = Sap (C.20) 


and write the expression (C.19) in the form 


. Am? 
U33U 3(5aa — Ua3U ġa) > sin” (Az). (C.21) 
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The latter formula is valid for all a and 8 and contains only |Ug3|? and |Ugs|?. 
Similar calculation shows that the last term in (C.8) is equal to zero. These results 
imply, in particular, that CP- violating effects are strongly suppressed in the regime 
(C.18). (They vanish in the limit Ama, L — 0.) The regime (C.18) is indeed realized, 
as a rule, in terrestrial es because of this fact, and also because of the 


smallness of the angle 13 (see below), the observation of CP-violation in neutrino 
oscillations is extremely difficult. 
Equations (C.8) and (C.21) give for the survival probability 


A 2 
P(Vq — va) = 1 — sin? 29eg sin? ( ——241 J, (C.22) 
4E 
where by definition 
sin? bep = |Uas|?. 


The formula (C.22) is analogous to the expression (C.13) valid for two-neutrino 
oscillations. The probability of appearance of neutrino of type 8 4 a is 


2 2 a2 Am; 
P(va > vg) = 4|Ug3| |Ua3|" sin TE L (C.23a) 
2 A 2 
= tne sin? (20¢f) sin? (As) . (C.23b) 


It differs from (C.12) by the first factor which accounts for admixture of two states 
with @ # a in the mass eigenstate v3. 

We now turn to the second special case. It occurs when the neutrino production 
region is sufficiently large and/or neutrino have fairly large energy spread, so that 
the oscillations with phases proportional to Am3, and Am3, get averaged, 


Amz 1 Amz 
2 31 A . 31 = 
(sin ( 7 2)) =y (sin ( 5 t)) 0. 


The quantity of interest for applications is the survival probability P(va —> va). 
Again using the unitarity condition (C.20), it can be represented as 


2 2 2 22 (Ams; 
P(va > Va) = 1 — 2|Ua3|"(1 — |Ua3|*) — 4|Ua2|"|Uo1|* sin EE : 
We introduce the mixing angle Ou by the relations 


[Ua]? = [Ua]? sin? 6! |Vo2|? 


2 Q/ 
cos“ 0 ==; n n E Á = —;s n 7r. 
ff Bafta, 1-WUaf foal? Uar 


As a result, we obtain finally 


A 2 
P(va > va) = \Uas|* + (1 = Ua’)? 1 — sin? 20y sin? ( m1) |, (C.24) 
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In the case of small admixture of neutrino va in the mass eigenstate v3, i.e., when 
|Ua3|? <1, the latter formula also transforms into (C.13). 


C.1.3. Mikheev—Smirnov-— Wolfenstein effect 


The formulas presented so far are valid for vacuum neutrino oscillations, and, more 
generally, when the influence of medium where neutrinos propagate is negligible. 
However, matter does affect neutrino properties in some situations. The correspond- 
ing phenomenon is called Mikheev-Smirnov—Wolfenstein effect [273, 274] (MSW); 
it is due to coherent forward neutrino scattering off electrons present in matter. 

MSW effect is accounted for by introducing an effective term in the Hamiltonian 
describing neutrino propagation. Recall that at relatively low energies, charged cur- 
rent interactions are described by the Fermi theory whose Lagrangian in the lepton 
sector is 


G 
Lee = aren —VPy)e- ey (1 — 7° ve = —2V 2G rey" e + EYuVe (C.25) 


(see Sec. B.5), where in the last equality we made use of the fact that neutrinos are 
left fermions, so that (1 —~7°)ve = Ve. For matter with electron number density ne 
we have 


((€xvere1)) = (ele) = ne, (C.26) 


where double brackets denote matter average and k, l are spinor indices. Assuming 
that electric currents are negligible (this is certainly the case for non-relativistic 
matter), we write 


((ExVir€1)) = 0. (C.27) 
Given that the operators é, and e; anticommute, we obtain from (C.26) and (C.27) 


_ 1 
((ekē1)) = ri Ne 


(in the representation where 7° is symmetric). Averaging the Lagrangian (C.25) 
over matter, we obtain the contribution to the effective Lagrangian describing the 
propagation of electron neutrino, 


1 
Leff = —2V2G piey" ((e@)) "Ve = 2V2G PN THe Yuve 
= ~V2G pnevey Ve. 


Hence, we conclude that matter effect is accounted for by the following substitution 
in the Dirac operator iy"O,, 


iy? — iy? — V2Grney, 
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i.e., the operator io is replaced by 
iðo — V, 
where 
V =V2Grne (C.28) 


is the matter contribution to the effective Hamiltonian. We emphasize that this 
contribution exists for electron neutrino only, as muons and 7-leptons are absent in 
matter. 

The last statement is not entirely correct. The effective four-fermion Lagrangian 
has the neutral current terms. The relevant structure is (see Sec. B.5) 


5 eye: DaYpVas 
a 

where summation runs over all neutrino types. These terms lead to new contribu- 
tion to the neutrino effective Hamiltonian which has the form analogous to (C.28). 
However, this contribution is now the same for all types of neutrinos. The latter 
property means that in the basis |va}) as well as in any other basis, this contribution 
is proportional to the unit matrix. Hence, it does not affect neutrino oscillations, 
leading only to additional time-dependent overall phase in the state vector. There 
is no need to consider this contribution in what follows. 

Thus, the effective Hamiltonian describing neutrino propagation in matter is 
different from (C.6), 


M? g 
Heg(L) = => + V(L). (C.29) 
2E 
The effective potential operator V has the only nonzero matrix element in the flavor 
basis, 
V(L)ap = V(L)beades; 
where V(L) = V2Grn.(L) and ne(L) is the electron density at distance L from 
neutrino source. 

The matter effect on neutrino propagation leads to a number of important and 
interesting phenomena. One of them is discussed in Sec. C.2.1. Here we make one 
simple observation. Namely, even in the two-neutrino case, the oscillation probabil- 
ities for neutrino and antineutrino are different in matter. The physical reason is 
that there are electrons in matter and no positrons. Hence, the presence of matter 
explicitly violates CP. At more formal level, the CP-transformation converts the 
electron density operator e into (—fie), so the matter contribution in the effective 
Hamiltonian for antineutrino differs by sign from (C.28). This fact is responsible, 
in particular, for additional difficulty in search for CP-violation in neutrino oscil- 
lations: neutrino beams produced by accelerators will pass through the Earth, and 
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only then will be detected, so one will have to discriminate between the “true” 
CP-violation (arising due to the phase in the PMNS matrix) and the matter effect. 
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C.2.1. Solar neutrinos and KamLAND 


At the Earth, the major contribution to the cosmic neutrino flux comes from ther- 
monuclear reactions in the center of the Sun, which are the source of solar energy. 
We list here the main reactions leading to the neutrino emission by the Sun, 


ptp—*H+et + rve (C.30) 
p+e+p—>’H+ nre (C.31) 
3He + p > tHe + et + ve (C.32) 
"Be +e” > "Li+ ve (C.33) 
8B — Be +e" + rve (C.34) 

By BC pet ap, 
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Only electron neutrinos are produced in these reactions. The energies of these neu- 
trinos range from zero to tens MeV; the energy spectrum of solar neutrinos is shown 
in Fig. C.2. Neutrino flux and spectrum are reliably calculated within the Standard 
Solar Model [275, 276] (SSM). 

Low-energy neutrinos interact extremely weakly with matter; they pass through 
the Sun and Earth with virtually no absorption or scattering. Despite the large 
neutrino flux, their detection is very difficult. The number of events per unit mass 
of a detector is small, so one has to install massive detectors (tens of tons to several 
tens of kilotons, depending on the type of detector), collect statistics for many years, 
and reduce the background by using radioactively pure materials, placing detectors 
deep underground (where the cosmic ray flux is substantially reduced), etc. 

Historically, the first experiment designed to measure the solar neutrino flux 
was built in the Homestake mine [277] (USA). It lasted for almost 30 years. Solar 
neutrinos were captured in the reaction 


FOl+y,— *Ar+e-. (C.35) 


A handful of 37Ar atoms were chemically extracted on regular basis from the 
615 tons target, and then their number was determined by counting the decays 
of radioactive 3’ Ar. Experiments of this type are called radiochemical. They mea- 
sure the integrated neutrino flux, weighted by the energy-dependent capture cross- 
section. The Homestake experiment was sensitive mainly to the boron neutrinos 
produced in reaction (C.34), with substantial contribution expected from reaction 
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Fig. C.2. Solar neutrino spectrum at the Earth [275] in the absence of oscillations. Contributions 
of various thermonuclear reactions are shown with estimates of accuracy of calculations. Neutrino 
flux from continuum is given in units cm~?s~+MeV~!. Energy ranges accessible for various solar 
neutrino detectors are indicated in the upper part. 


(C.33) and other reactions (but not (C.30)). The measured integrated flux of elec- 
tron neutrinos F! turned out to be smaller than the calculated SSM flux ®§2),, 


—— = 0.34 + 0.05. (C.36) 


This result was the first indication that on the way from the center of the Sun to 
the Earth, electron neutrino transforms into neutrinos of other types, which do not 
participate in the reaction (C.35). 

Boron neutrino flux in the high-energy part of the spectrum was then mea- 
sured by the Kamiokande detector [278] (Kamioka mine, Japan?), neutrino ener- 
gies E,, > 7 MeV, and later by Super-K [279] neutrino energies E,, > 5.5 MeV 
and Ep, > 5.0 MeV at different stages of the experiments. These detectors used 
water as detector material; target masses were about 1 kiloton and 22.5 kilotons for 


3Hereafter we indicate only the geographical position of the detector. The experiments are per- 
formed by collaborations of scientists from various countries; the lists of collaboration members 
can be found in the original literature. 
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Fig. C.3. Neutrino-electron elastic scattering via exchange of W-boson (a) and Z-boson (b). 


Kamiokande and Super-K, respectively. Neutrino participates in the elastic scatter- 
ing reaction 


v+e >vte, (C.37) 


which results in the production of relativistic electron, whose Cherenkov radiation 
was detected. The measured flux of solar neutrinos was again lower than the SSM 
prediction (Super-K data), 


os-k 


=. 0.41 + 0.07. (C.38) 
SSM 

Note that the error here (and to a lesser extent, in (C.36)) is mainly due to SSM 
uncertainties; the flux PSTK itself is measured with much better precision. 

Elastic scattering off electron is experienced by both electron neutrino and vp, 
v, (see details in Appendix B, in particular, the Lagrangian (B.15) and discussion 
in Sec. B.5). vee~ scattering occurs through the exchange of W-boson (charged 
currents, Fig. C.3(a)) and Z-boson (neutral currents, Fig. C.3(b)). On the other 
hand, only Z-boson exchange contributes to v,e and ve scattering (Fig. C.3(b)). 
The Z-boson exchange results in smaller scattering amplitude than that due to 
W-boson, so the effective neutrino flux detected in the elastic scattering reaction 
(C.37) is proportional to 


ve x &,, +0.15(8,, +B, ). (C.39) 


This property is important for interpreting the results obtained with the detector 
SNO, which we discuss later on. 
The next solar neutrino experiments were radiochemical; they used the reaction 


TlGa+ve een, 


followed by chemical extraction of “Ge atoms and counting of their radioactive 
decays. These are the SAGE experiment [280] (Baksan Neutrino Observatory, 
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Russia, 60 tons of gallium) and GALLEX/GNO [281] (Gran Sasso Laboratory, 
Italy, 30 tons of gallium). Unlike in other experiments, the largest contribution 
to the measured integrated flux comes from neutrinos produced in reaction (C.30), 
though sizeable contributions are due to neutrinos from the reactions (C.33), (C.34) 
and others. The integrated flux measured in the gallium experiments (duration of 
measurements exceeded 10 years) is also significantly lower than the SSM predic- 
tion (note the consistency between the two results obtained, in fact, using somewhat 
different techniques), 


pea 

=Ga = 0.54 + 0.06 SAGE, 

PSSM 

pea 

=Ga = 0.56 + 0.06 GALLEX/GNO. 
PSSM 


The key role of these data was that they eliminated hypothetical possibility that 
the observed deficit of boron neutrinos results from some error in SSM, i.e., it has 
astrophysical nature. Indeed, in contrast to (C.34), reaction (C.30), most relevant 
for gallium experiments, directly determines energy production in the Sun, so the 
p-p neutrino flux can be deduced in practically model-independent way from the 
well-measured luminosity of the Sun (barring very exotic possibilities). Hence, after 
gallium experiments, neutrino flavor transition became the only explanation of the 
solar neutrino deficit. 

Final argument that directly established the fact of transitions of ve to v, and 
v, on their way from the center of the Sun, came from the SNO detector [282] 
(Sudbury Neutrino Observatory, Canada). This detector used 1 thousand tons of 
heavy water as the detector material.4 Neutrinos were detected in the reaction of 
elastic scattering (C.37) as well as in the reactions 


Ve+*"Hopt+pt+e (CC), (C.40) 
v+’°H—>p+n+v (NC). (C.41) 


Like Kamiokande and Super-K, the SNO detector was sensitive to boron neutrinos 
of energies E, Z 5 MeV. In reaction (C.37), the neutrino flux combination (C.39) 
was measured; the result was in agreement with Super-K (though it had larger 
statistical uncertainty). On the other hand, the charged-current reaction (C.40) 
exists for electron neutrino only, so the electron neutrino flux ®,, was obtained by 
measuring its rate. Finally, the reaction (C.41) is purely neutral-current, so its rate 
is determined by 


Onc = Pr, +P, +DL. (C.42) 


4 At later stages 2 tons of salt were added to increase the sensitivity to neutral currents. 


518 Neutrino Oscillations 


Neutrino fluxes measured in reactions (C.40), (C.41) as compared to the SSM 
predictions are 


—“ _ = 0.30 + 0.05, (C.43) 


— NC = 0.87 + 0.19. (C.44) 

®nc,ssM 
The result (C.44) shows that the Standard Solar Model predicts the emitted flux 
of boron neutrinos correctly, while from (C.43) it follows directly that about 2/3 
of them are converted from ve to Vv, and v, when traveling to the Earth. It is also 
important that the results (C.43) and (C.44) are consistent with (C.38), taking into 
account (C.39). 

In fact, the agreement between the experimental data is even better than it might 
seem from (C.38), (C.43) and (C.44). As we have already noted, large contribution 
to errors in (C.38), (C.43), (C.44) is due to the uncertainty in the SSM calculation. 
By themselves, the experimental data have an error of less than 10%; this is the 
precision they agree among themselves. We emphasize that irrespective of SSM, the 
three measured combinations of fluxes, ®,,, PNO and Pir have two independent 
parameters ®,, and ®,,, + ®,,. 

The fundamental result of electron neutrino oscillations has been confirmed by 
KamLAND experiment [283] (Kamioka mine, Japan). KamLAND detector contains 
1 thousand tons of liquid scintillator and detects antineutrinos produced in nuclear 
reactions at Japanese nuclear power plants. Distances to them range from 70 km 
to 250 km and more, so that the effective baseline is about 180 km, in contrast 
to earlier reactor experiments with much shorter baseline. KamLAND observed 
the deficit of electron antineutrinos as compared to the value calculated under no- 
oscillation hypothesis, 


pkamLAND 
—— = 0.66 + 0.06. 


Pno OSC 
Thus, electron antineutrino of energies Æ ~ 3—6 MeV (the range relevant for Kam- 
LAND) experiences transition into other types already at distance of about 100 km. 
Finally, the flux of monoenergetic neutrinos born in reaction (C.33) was mea- 
sured in Borexino experiment [284] (Gran Sasso). This experiment makes use of 280 
tonns of liquid scintillator. The measured ratio of neutrino flux with energy 862 keV 
to SSM prediction is [296] 


@MPBorexino 


= 0.62 + 0.05, 


@Borexino 


which again implies electron neutrino transition into other neutrino flavors. In fact, 
Borexino is capable of measuring solar neutrinos in a wide range of energies, from 
pp to boron. Thus, different regimes of solar neutrino transitions (see below) are 
probed in one and the same experiment. 
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To describe the solar neutrino data and KamLAND results at the present level of 
experimental accuracy, it is sufficient to use the two-neutrino picture of oscillations 
between electron neutrino ve and some linear combination Ÿ of muon neutrino and 
T-neutrino. This is the second special case studied in Sec. C.1.2: the relevant mass 
squared difference Am? = Am3, is the smallest one, and the PMNS matrix indeed 
has |Ue3|? < 1; see Sec. C.2.3. In the two-neutrino picture, the data are described 
by the following parameters of oscillations (precise ranges are given in Sec. C.3) 

Amy ~ 1074 eV’, (C.45) 


sol — 


bsol ~ 35°. (C.46) 


Notably, of great importance to solar neutrinos is the MSW effect. In the two- 
neutrino approximation, the effective Hamiltonian in the flavor basis (1,7) has the 


Am? —co0s26so1 sin 20s, 1 0 
re) er i bee), ETT) l (C.47) 
4E sin20so1 cos 20s61 0 0 


form 


where so, is the vacuum mixing angle, so that the mass eigenstates are 
\v2) = |ve) sin Aso1 + |Č) cos Osor,  |1) = |Ve) COS Aso1 — |) sin Oso. 


Recall that in vacuum, by definition, the heavier state is |v2). 

The electron density is ne = 6-10?° cm~® in the center of the Sun and decreases 
with the distance from the center. Hence, the estimate for the maximum value of 
the potential V is 


Vmax = V(L = 0) ~ 8- 107” eV. 
This implies that the relation 
Am? 


sol 
=n 7 Vmax 


4E 


holds at E ~ 3 MeV. For a neutrino of significantly lower energies (e.g., pp-neutrino) 
the matter effects are negligible, and one can use (C.24) with |U-3|? <1. At E = 
3 MeV, on the contrary, the effect of solar matter is important. Two-neutrino mixing 
in the latter situation is characterized by effective mixing angle 0m related to the 
vacuum angle fso} as follows: 


sin? 26.01 
sin? 26,01 + (2V(L) E/Am2,, — cos 20501)? 


sol 


sin? 20 mM = 


As an example, consider neutrino produced in 8B decay. Its characteristic energy 
is 4—10 MeV. In this case, the matter term V dominates in the center of the Sun 
over the neutrino mass term in the Hamiltonian. Let |v;(L)) be eigenvectors of the 
matrix (C.47) at given distance L from the solar center, and |12(L)) refers to larger 
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Fig. C.4. Electron neutrino survival probability as function of energy [297]. See Fig. 13.23 for 
color version. 


eigenvalue. Since V > 0, see (C.28), |v2) coincides with |v-) in the center. Thus, the 
decay of 8B produces the eigenstate |v2). 

Let us consider further evolution of this state, making use of the adiabatic 
approximation. Recall that in this approximation, the quantum system is always 
stuck at one and the same energy level. In our case, this means that the neutrino 
is always in the state |v2(L)). So, on the solar surface it is in the state which coin- 
cides with the state |v2) in vacuum. This is still mass eigenstate, therefore it does 
not oscillate during the further propagation in vacuum." Thus, the probability of 
detection of electron neutrino at the Earth is 


P(ve > Ve) = | (velve)|? = sin? bsol- (C.48) 


Equation (C.46) then implies that P(ve —> ve) < 0.5. We emphasize that the 
observed fact that the measured flux of boron ve is smaller than half of the pre- 
dicted one (see (C.43)) is direct evidence for the MSW effect: in the case of vacuum 
oscillations, electron neutrino survival probability averaged over energies cannot be 
less than 50%, once two-neutrino approximation is valid; see Eq. (C.11). The solar 
neutrino transition data in the energy range 0.3—10 MeV, shown in Fig. C.4, nicely 
illustrate the transition from vacuum oscillation regime to transitions dominated by 
dense solar matter. 

We note one feature evident from the above analysis. Consider an unrealistic 
case of small vacuum mixing, | sin @,,1| < 1. In that case, the survival probability 


5Note that in the adiabatic regime, neutrino oscillations never occur: the neutrino is constantly 
in the eigenstate |v2) of the local effective Hamiltonian. 
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c 


(a) 


Fig. C.5. The evolution of levels of neutrino Hamiltonian with distance from the center of the Sun. 
(a) No mixing, (b) Small but non-vanishing mixing. 


(C.48) is low. In the case of vacuum oscillations the situation is opposite: Eq. (C.11) 
gives P,..,, = 1 — O(sin? 20501). Here we have an example of the Mikheev-Smirnov 
resonance that enhances neutrino transition in matter. The resonance picture is as 
follows. In the absence of mixing, vectors |ve) and |”) would be the eigenvectors of 
the operator (C.47) with sin 26,,; = 0. In the center of the Sun and in vacuum, |.) 
corresponds to larger and smaller eigenvalue, respectively. (The assumption that 
the lighter neutrino is mainly |v-) in vacuum is important here.) The evolution of 
levels with L would have the form shown in Fig. C.5(a). If small mixing is switched 
on, the levels no longer intersect, as we know from quantum mechanics, and the 
evolution of the levels becomes as shown in Fig. C.5(b). The heavier neutrino in the 
center of the Sun almost coincides with electron neutrino, while it is almost vy in 
vacuum. In the adiabatic evolution, the transitions from level to level do not occur; 
this explains the low survival probability of electron neutrino for small but finite 
sin 0 sol: 


Problem C.1. Show that for small sin@so1, the eigenvalues of the Hamilto- 
nian (C.47) indeed evolve with L as shown in Fig. C.5(b). 


Problem C.2. Consider the (unrealistic) model of the Sun, in which the density of 
free electrons ne(L) changes linearly with L from its value in the center, 6-10% cm~’, 
to zero at the surface (at Lo = 7-10° km). In what region of parameters Am? and 
sin 6,0, the evolution of the neutrino state with L is adiabatic? Consider separately 
the cases of weak and strong mixing, | sin Oso1| < 1 and | sin Aso1| ~ 1. 


Problem C.3. Find an analog of Eq. (C.48) in the case of oscillations between 
three types of neutrinos, assuming |Ue3|? < 1. 


To end this discussion, we note that the MSW-effect can lead to a number of 
other features in neutrino experiments, such as “day-night” effect, the difference 
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in the measured flux of solar neutrinos at night (when neutrinos pass through the 
Earth) and daytime. 


C.2.2. Atmospheric neutrinos, K2K and MINOS 


Oscillations of muon neutrino were discovered in experiments of another class. 
They first showed up in measurements of atmospheric neutrino flux with 
Kamiokande [285] and Super-K [286, 287]. 

Our Galaxy is filled with cosmic rays — charged particles (protons and nuclei) 
propagating in space. Their interaction with the Earth atmosphere gives rise to 
secondary particles. As a result, large number of particles are produced, among 
which the dominant component is the lightest hadrons, pions (with rather small 
admixture of kaons). Charged pions 7* do not reach the Earth surface and decay 
in the atmosphere, producing muons and muon (anti)neutrinos, 


mt ty, 17 > Dp (C.49) 
If the energy of the primary particle is not too high, muons, in turn, also decay, 
again giving rise to neutrinos: 


a eT VD, jb > e DeVy. (C.50) 


Neutrinos produced in reactions like (C.49), (C.50), are called atmospheric; neutrino 
energies relevant for oscillations range from hundreds MeV to tens GeV. 


Problem C.4. At what energies of primary particle most muons reach the Earth 
surface? Assume that the average multiplicity (number of particles produced in a 
collision) is 10 to 500 at energies of tens GeV to hundreds EeV. Hints: Muons 
practically do not interact with the atmosphere; hadron (including pion) mean free 
path with respect to inelastic scattering is about 10% of atmospheric depth. 


The cosmic ray flux is isotropic, so the flux of atmospheric neutrinos in the 
absence of oscillations should be isotropic as well.° However, the observed flux of 
muon neutrinos and antineutrinos actually depends on the zenith angle (right panel 
of Fig. C.6). This means that muon neutrinos coming from above and flying just a 
few kilometers from the production point to the detector, have no time to oscillate; 
on the other hand, neutrinos coming from below pass through the entire Earth, and 
have time to partly transform into other types of neutrinos. At the same time, the 
effect of oscillations on the electron neutrino flux is small (left panel of Fig. C.6). 
The muon neutrino deficit, together with very small excess of electron neutrinos, 
suggest that there is v, — v; oscillation. This result is confirmed by the entire set 


6For neutrinos with energies of several GeV and above, the isotropy gets lost: the flux has a peak 
in the horizontal direction due to the fact that horizontal muons travel longer in the atmosphere 
and thus have more time to decay. This phenomenon is straightforwardly accounted for. 
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Fig. C.6. Dependence of neutrino fluxes at energies below 1 GeV on zenith angle [287]: the left 
and right panels refer to electron and muon neutrinos, respectively. Solid lines show the prediction 
in the absence of oscillations, while dotted lines are for oscillations with parameters obtained by 
fitting the data. 


of data on atmospheric neutrinos, including the measurements of absolute fluxes of 
Ve, Ve and Vu, Du, neutrino fluxes at energies above 1 GeV, etc. 

Muon neutrino oscillations were confirmed by the K2K experiment [288]. Here 
the source of muon neutrinos are pions produced by proton beam from the accel- 
erator of KEK Laboratory in Japan and they decay according to (C.49). These 
neutrinos are detected by Super-K. The distance from production to detection is 
250 km (the distance between the KEK Laboratory and Kamioka mine), and the 
neutrino energy is 0.5—3 GeV. K2K experiment found disappearance of muon neu- 
trinos: their flux at Super-K is smaller than the flux mesured by the “near” neutrino 
detector located directly at the KEK Laboratory. The results of K2K experiment 
are in good agreement with atmospheric neutrino data. 

Study of muon neutrino oscillations is the purpose of two other experiments with 
accelerator neutrinos. In one of them, muon neutrinos and antineutrinos produced at 
Fermilab accelerator (Batavia, USA) are detected by MINOS detector (Minnesota, 
USA) at distance 735 km. Another experiment T2K makes use of muon neutri- 
nos produced at JPARC accelerator complex (Tokai, Japan) and detects them by 
Super-K at distance 295 km. A peculiarity of T2K is that the axis of the neutrino 
beam, wich spreads over fairly large solid angle, is off the direction towards the 
detector by angle 2.5°. This leads to narrower energy distribution of neutrinos at 
Super-K at the expence of somwehat lower statistics. The results of MINOS [298] 
and T2K [299] on muon neutrino disappearance are in agreement and, together with 
Super-K and K2K data, give better determination of the oscillation parameters. 


524 Neutrino Oscillations 


The results of experiments briefly reviewed in this Section are also well described 
within the two-neutrino oscillation picture.’ The simplest and most plausible pos- 
sibility is the oscillations of v, to v+. (Otherwise new types of neutrino have to be 
introduced.) In that case, the oscillation parameters are estimated as follows (more 
precise estimates are given in Sec. C.3) 


Arm? in ~ (2 —3)- 1078 eV?, (C.51) 
Om = 45°. (C.52) 


Note that 0gim = 45° corresponds to maximum mixing: the parameter sin? 20atm 
entering (C.11) is equal to 1. Matter effects are of little importance for accelerator 
neutrinos, so in that case we are dealing with vacuum oscillations. 


Problem C.5. Using the parameters of v,-v; system, determine the energy at 
which neutrinos oscillate substantially when crossing the Earth. Do parent muons 
have enough time to decay in the atmosphere? 


Problem C.6. Estimate what part of v, disappears (transforms into vr) in the 
K2K experiment. 


C.2.3. Accelerator and reactor neutrinos: |Ues| 


The parameter Ue3 determines the admixture of electron neutrino Ve in the mass 
eigenstate v3. It gives rise to two effects, showing up in experiments with accelerator 
neutrinos and reactor neutrinos, respectively. The first effect is the appearance of 
electron neutrinos in muon neutrino beam, and the second is the disappearance of 
electron antineutrinos at fairly short distance. The Earth matter effect is not very 
important in both cases, and, furthermore, the inequality (C.18) holds. Thus, we are 
dealing with the first of special cases considered in Section C.1.2. The appearance 
of ve in the muon neutrino beam is approximately described by the formula (C.23a) 
with a = u, B = e, while the electron neutrino survival probability is given by 
(see (C.22)) 
2 
P(H > De) = 1 — 4|Ueg|?(1 — [Uea|*) sin? (1) 

4E 
Clearly, both effects would be absent if Ue3 = 0. 

The first evidence for nonzero Ue3 was obtained at the accelerator neutrino 
experiments T2K [300] and MINOS [301]. They observed the excess of electron 
neutrinos in the beams of mostly muon neutrinos (with small admixture of electron 
neutrinos), although confidence level of this excess was not sufficient to claim the 


? 


TThe two-neutrino oscillation approach, generally speaking, is not applicable to atmospheric neu- 
trinos. In this case, almost complete absence of any distortion of electron neutrino flux is accidental: 
it is due to the interplay between nearly maximum mixing (C.52) and the relation between muon 
and electron neutrino fluxes produced in the atmosphere, y. /®v, ~ 1/2. 
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Fig. C.7. Survival probability of electron antineutrino in Daya Bay experiment. EH1 and EH2 
denote near detectors, and EH3 stands for far detectors. The solid line is the theoretical expectation 
for best fit value of Ue3. The oscillatory behavior is observed due to the capability of measuring 
antineutrino energy. 


discovery. The disappearance of electron antineutrinos was first observed, also at a 
confidence level insufficient for claiming discovery, in a reactor antineutrino experi- 
ment Double CHOOZ in France [302]. The disapearance effect was firmly established 
in reactor antineutrino experiments Daya Bay [303] in China and RENO [304] in 
South Korea. Antineutrinos in Daya Bay experiment are produced by several indus- 
trial reactors and detected both by near detectors (at 500—600 meters from reactors) 
and far detectors at distance 1.6—1.7 km. Antineutrino energy is 2-3 MeV, and for 
mass square difference Am?,,,, it follows from (C.14) and (C.51) that the disappear- 
ance probability is small at near detector sites and is close to maximum at far sites. 
This is precisely what is observed at Daya Bay; see Fig. C.7. 

Similar results are obtained by RENO experiment which employs near and far 
detectors 294 m and 1.4 km away from reactor, respectively. These two experiments 
obtain (precise estimates are given in Sec. C.3) 


|Ue3|2 ~ 0.025. 


Thus, the only unknown parameter of the PMNS marix (C.16) is the CP-phase ô. 
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Figure C.8 (see also Fig. 13.10 on color pages) shows the regions in the parameter 
space® tan? @,.; and Am? allowed by solar neutrino experiments and KamLAND. 
It is seen that all data are consistent with each other in the region (C.45), (C.46). 


8Sometimes the data are parameterized by sin? 26 rather than tan? 6. In the case of vacuum 
oscillations, the former is more natural, see Eq. (C.11). As can be seen from (C.48), sin? 20 is 


not an appropriate parameter in cases where matter effects are significant. Indeed, sin 20 does not 
change under the replacement 0 — (7/2 — 8), while the probability of oscillations in matter does. 
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Fig. C.8. Allowed region of parameter space for oscillations ve +> / obtained from solar neutrino 
experiments and KamLAND [305]. (See Fig. 13.10 for color version.) 


Figure C.9 shows similar data on atmospheric neutrinos, T2K and MINOS. 
Here the allowed region is around (C.51), (C.52). The results of reactor neutrino 
experiment Daya Bay are presented in Fig. C.10. 

We mention here anomalies in several neutrino experiments, which may possibly 
point towards the existence of the third mass squared Am? >> Am?,,,,. These are 
accelerator experiments LSND [308] and MiniBooNE [309], gallium experiments 
with artificial neutrino sources [310] and to some extent reactor neutrino exper- 
iments [311-313]. If these anomalies are confirmed, this would mean that there 
exists the fourth neutrino species, which is light and sterile against the Standard 
Model gauge interactions. For the time being, the results on the anomalies have too 
large statistical and systematic uncertainties, so the situation is quite unclear. 

We limit ourselves with three neutrino flavors and recall that the PMNS matrix 
responsible for oscillations has the form (C.16). The results shown in Figs. C.8 
and C.9 correspond to the following absolute values of the mixing matrix elements 
(30 C.L.) 


0.795-0.846 0.513-0.585 0.126-0.178 
[Uai] = | 0.205-0.543 0.416-0.730 0.579-0.808 
0.215-0.548 0.409-0.725 0.567—0.800 


Current allowed intervals of the mixing angles are (30 C.L.) [314]: 


0.273 < sin? 612 < 0.354, (C.53) 
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Fig. C.9. (Color version on color pages.) Allowed region of parameter space for oscillations 
Vy, > vr [307]. Lines encircle areas allowed by T2K at 68% and 90% C.L. Shaded regions show 
the results (at 90% C.L.) of MINOS and atmospheric neutrino study at Super-K. The results are 
presented for normal and inverse hierarchy of neutrino masses, NH and IH, respectively. Dots show 
the best fit values for the two hierarchies. 
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Fig. C.10. (Color version on color pages.) Allowed regions of parameter space for De > De 
oscillations determined by the Daya Bay reactor neutrino experiment [306]. Dashed lines show 
the results obtained by analyzing solely the antineutrino energy spectrum. Horizontal line Am? 
corresponds to the measurement of the same parameter by MINOS. 
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0.0181 < sin? 013 < 0.0327, (C.54) 
0.341 < sin? 623 < 0.670. (C.55) 


Note that most of the mixing matrix elements are of the same order, Uaj ~ 0.5. This 
“anarchy” in the neutrino mixing matrix distinguishes it from its quark counterpart, 
the CKM matrix (B.29) exhibiting noticeable hierarchy between its elements. 

The results shown in Figs. C.8 and C.9 also determine the differences of masses 
squared. Combined analysis of existing data yields [314] 


7.04- 107-5 eV? < Am2, = Am? < 8.12-1075 eV?, ese 
C.56 
2.27 - 1078 eV? < |Am2,| = Am? < 2.69 - 1073 eV? 


atm 


at 30 C.L. As we discussed above, these values can be obtained with two different 
hierarchies in neutrino masses; see Fig. C.1. 

Which of these two hierarchies exists in Nature is not yet known. At the same 
time, Eq. (C.56) gives lower limits on neutrino masses: at least one of them must be 


M > Matm = \/ Amp, ~ 0.05 eV, (C.57) 


while another is not less than 


Msot = 4/ AM? ~ 0.009 eV. (C.58) 


Minimal possibility is that 
my K Msol, M2 = Msol, m3 = Matm, (C.59) 


(normal hierarchy without degeneracy), but other options are also widely dis- 
cussed, including the case of rather heavy neutrinos almost degenerate in masses, 
mM, M2, M3 D Matm- 


C.4. Dirac and Majorana Masses. Sterile Neutrinos 


There are two different types of fermion masses in 3 + 1 space-time dimensions: 
Majorana and Dirac. The two Lorentz-invariant mass terms in the Lagrangian for 
fermion f are 


m 


y = TM Fef, + he. (C.60) 
Le = -Mp frfa + h.c., (C.61) 


where f° is the charge-conjugate fermion field. 
Recall that 4-component Dirac spinor can be expressed in terms of 2-component 
Weyl spinors Xz, Êr (see Sec. B.1 of Appendix B). In the Weyl basis of the Dirac 
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matrices 


Hence, the mass terms (C.60) read 


Le = -2 xT ioex, + h.c., 


LF = -mpi x, + hic. 


The Dirac mass is only possible if the theory contains both left and right fermion 
components, while the left component (or right component) is sufficient by itself 
for the Majorana mass. Electrically charged fermions can only have Dirac masses, 
otherwise the charge would not be conserved. 

The Standard Model contains only left components of neutrinos, so its mini- 
mal generalization with massive neutrino and without additional fields involves the 
Majorana mass term, 


Mab c 


Dral rg T PGs (C.62) 


M _ 
L S= 


where we used the flavor basis for neutrino fields, and summation over flavor indices 
is assumed. The mass matrix of light neutrinos mag is symmetric, and it can be 
diagonalized by transformation 


m= Os pnag C8 Saad ae 


UPMNS 


where is a unitary matrix and m9 is a diagonal real matrix. UP” is 


precisely the PMNS matrix in the case of Majorana neutrinos. 


Problem C.7. Show that the expression Df „Vra is symmetric in a and B. Hint: 


note that fermion fields anticommute. 


Problem C.8. Show that the flavor basis in related to mass basis by v; = USMS va, 
which is consistent with (C.1). 


Let us come back to the issue of the physical significance of phases in diagonal 
matrices Dı and D2 which enter the general form (C.15) for the PMNS matrix. In 
the first place, one can extract from D2 a factor proportional to the unit matrix; this 
factor commutes with the matrix U and can be absorbed into D1. Therefore, without 
loss of generality one sets Dz = diag(e', e?,1). Now, the neutrino kinetic terms 
are invariant under independent phase rotations of each of the neutrino fields, vry —> 
Ba Via; the interactions with W- and Z-bosons are also invariant under these 
rotations supplemented by the phase rotations of charged fermion fields. Making 
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use of these rotations one can make the matrix D; equal to the unit matrix. Thus, 
the complete PMNS matrix has the form UP’%° = Ọ D», i.e., 


1 0 0 C13 0 sy3e% 
UPMNS— |0 c23 S23 0 1 0 
0 —s23 C23 —sy3e"" 0 cis 

cı s2 O #2 0 0 

x | —si2 &2 0 0 ei2/2 0 

0 0 1 0 0 1 


We emphasize that the last factor (i.e., phases ô1, ô2) is physical for Majorana 
masses only; in the Dirac case one can get rid of this factor in the same way as in 
Sec. B.4. 

Since the Majorana mass term mixes field with its charge-conjugate, the notions 
of particle and antiparticle are not quite adequate for Majorana neutrino. This 
means, in particular, that there is no conserved lepton number: the Majorana mass 
violates lepton number explicitly. Indeed, the expression (C.62) is not invariant 
under phase rotations v — e’*v, 7 > e~'“p. In the case of ultra-relativistic Majo- 
rana neutrino, the eigenstates of the Hamiltonian are states with left and right 
helicities, and up to corrections suppressed by the ratio m/E the left-helicity and 
right-helicity states coincide with neutrino and antineutrino states of massless neu- 
trino theory, respectively. 


Problem C.9. Prove the above statement. To do this, obtain the analog of the Dirac 
equation in the case of the Majorana mass, find its solution in terms of creation 
and annihilation operators and compare it with the solution to the Weyl equation 
for massless left fermions. 


Problem C.10. Show that in the ultra-relativistic case, the off-diagonal Majorana 
mass (C.62) gives rise to oscillations between states of one and the same helicity 
(i.e., left-helicity state oscillates into left-helicity state of different flavor, and not 
into right-helicity state, and vice versa). 


The result of the latter problem implies that oscillations vg + Vg, Da œ Vg, but 
not Va + Vg, are possible in the case of the Majorana mass term. Here neutrino and 
antineutrino are understood as left- and right-helicity states, respectively. With this 
convention, all results of massless theory concerning neutrino interactions remain 
valid: for example, it is antineutrino (right-handed state) that is predominantly 
produced in the neutron decay, while the admixture of left-handed state (neutrino) 
is suppressed by powers of m,/E,. 

Mass term (C.62) cannot be obtained from any SU(3). x SU(2)w x U(1)y- 
invariant renormalizable interaction. Giving up renormalizability, one writes down 
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the interaction of the form 


aß o - 
Lint = 5 < LOH. H’ Le + h.c., (C.63) 
a, bB s 


where we introduced dimensionless couplings °? (indices a, 8 = 1,2,3 label gen- 
erations); A, is the energy scale of a theory that generalizes the Standard Model 
at high energies and leads to non-renormalizable interaction (C.63) at low energies; 
the field H is related to H by (B.10). The Englert-Brout—Higgs field H acquires 
vacuum expectation value (see Appendix B), and the coupling (C.63) gives rise to 
the mass term 


v2 


5A ELDE yg + h.c., 


which coincides with (C.62). Note that the term (C.63) is of the lowest possible 
order in A>+ among all terms capable of giving masses to neutrinos; this is the 
reason we picked it up. 

For ¿°P ~ 1, the scale of new interaction must be of order A, ~ 1015 GeV to 
yield neutrino masses of order 107? eV. This scale is close to the Grand Unification 
scale in supersymmetric theories. 

Non-renormalizable effective interaction (C.63) can arise from renormalizable 
interaction involving new heavy fields, just like the Fermi four-fermion interaction 
emerges upon integrating out massive vector bosons of the Standard Model. The 
smallness of neutrino masses as compared to the masses of other Standard Model 
fermions requires strong hierarchy between the known Yukawa couplings and the 
new couplings, Vey > €°%, and/or between the electroweak scale and the mass 
scale of new heavy fields. In specific models, these hierarchies may have one or 
another natural explanation. 

One possibility here is the so-called see-saw mechanism. To begin with, let us 
consider this mechanism in the case of one type of conventional neutrino v. This 
neutrino is a component of the left lepton doublet L of the Standard Model. Let 
there be another left fermion N,, which is a singlet under the Standard Model 
gauge group SU(3). x SU (2)w x U(1)y. (Equivalently, one introduces right fermion 
N£.) In contrast to the known fields of the Standard Model, the field N, can have 
Majorana mass M unrelated to the vacuum expectation value of the Standard 
Model Englert—Brout—Higgs field. It is remarkable that the gauge invariance of the 
Standard Model allows the Yukawa interaction which couples N, and v to the 
Standard Model Englert-Brout—Higgs field H. So, the renormalizable Lagrangian 
for the fields N, and L includes the terms 


M ~ = 
£L=-—WNEN, —yNCH'L + he, C.64 
2 
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where y is the Yukawa coupling. As a result of electroweak symmetry breaking, the 
field Ht obtains vacuum expectation value (v/V2,0), leading to the mass terms 


M -= 


v2 


Combining left fermions N, and v into the column 


Nev + h.c.. (C.65) 


y= j (C.66) 
we find that the mass term (C.65) can be written as 


1- 
L= seem + h.c., 


where the matrix m is? 
O m 
P (C.67) 
mp M 
and 
yv 
Mp = 


T 
At M > mp, the eigenvalues of the mass matrix (C.67) are (the sign of fermion 
mass is irrelevant) 


2 PREF. 
mi, you 
eee, C.68 
a M 2M (en 
my =M, 


(modulo corrections suppressed by mp/M) where the smaller eigenvalue (C.68) 
corresponds to the eigenvector 


(again up to small corrections). It is seen from (C.66) that the main component 
of this eigenvector is the ordinary neutrino v. Thus, as a result of this mechanism, 
neutrino acquires Majorana mass m,, which is naturally small for M >> mp. This is 
precisely the see-saw mechanism.'° Note that at y = 10~° — 1 (the range of known 


°Tf instead of left field Nz, one uses the right field N¢, then the second term in (C.65) has the 
form of the Dirac mass term in which Nf serves as the right component. Hence the notation mp. 
10The scenario we discuss here [203] is known as type I see-saw. There are two other types. Type 
II see-saw [206] employs a new scalar field which is a triplet under the electroweak SU(2) and 
has nonzero weak hypercharge; small neutrino masses emerge due to the hierarchy between the 
vacuum expectation values of this field and Englert—Brout—Higgs doublet. To construct type HI 
see-saw [315], one introduces fermionic triplets under the electroweak group, and the small neutrino 
masses are obtained in a way similar to type I see-saw. 
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Yukawa couplings of the Standard Model), the mass m, ~ 107? eV is obtained at 
M ~ 10? — 101" GeV, 


i.e., the condition M > mp is satisfied indeed. 

It is worth noting that the result (C.68) can be obtained by integrating out 
the heavy field N,. To this end, let us write the equation obtained by varying the 
action with respect to N,. With the gradient term iN 19” Nz in the Lagrangian 
and mass terms (C.65), we obtain 


yv pe 


V2 


When the momentum and energy are small compared to M, the first term on the 
left-hand side is negligible, and the field N, is algebraically expressed through the 
field v, 


—ið Ney + MNE + =0. 


yv 
V. 
2M 


L” 


We substitute this expression back into the original Lagrangian and thereby obtain 
the effective Lagrangian for the field v. The correction to the kinetic term is negli- 
gible, and the main effect is the mass term 

y2v2 


= IM pov + h.c. 


Lm, 


We see that the Majorana neutrino mass is indeed given by (C.68) up to sign 
omitted there. We also note that starting from the original Lagrangian (C.64) and 
integrating out the heavy field N}, we obtain the effective Lagrangian of the form 
(C.63) with A, = M and £ = y?. 

We now turn to the realistic case of three types of neutrinos. In this case, it 
is natural to introduce three fields Na, a = 1,2,3 (subscript L is omitted) and 
generalize the Lagrangian (C.64) as follows, 


1 7 Ep 
L= no MaN Na = Yag NEH Lg + h.c. 


Here Mag and yag are 3 x 3 matrices (generally speaking, complex) and the matrix 
Mag is symmetric. One can always choose the basis of fields Na in such a way that 
the matrix Mag is real and diagonal, 


M= diag( Mı, Mo, M3). 


In this basis, the field Na describes heavy sterile neutrino of a certain mass. The 
easiest way to find the effective mass term for light neutrinos is to integrate out 
heavy fields Na, as outlined above. As a result, we obtain the Majorana mass term 
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for light neutrinos in the form (C.62) with the mass matrix 
m= -mM tm], (C.69) 


where 


Yapv 


Mpag = : 
Daß v2 


In general, the light neutrino masses and the parameters of the PMNS matrix non- 
trivially depend on the elements of both the diagonal matrix M and matrix of 
Yukawa couplings Yag. 

Let us now discuss the possibility that the known neutrinos have the Dirac 
masses. In this case one adds new light fields VRa, which are the right components 
of neutrinos. Then the Dirac mass term reads 


LP = —mopinatip + h.c., (C.70) 


where the flavor basis is used again. These right components must be neutral (ster- 
ile) with respect to the Standard Model gauge group, otherwise they would give 
contribution, for example, to the total width of Z-boson, whlich is measured with 
high precision and is consistent with the Standard Model prediction. 

Since the Dirac mass is invariant under charge conjugation, the notion of lepton 
number makes sense in this theory: the mass term (C.70), like all other terms of the 
Standard Model Lagrangian, are invariant under the transformation 

q 


Va — e! Va, Var e Da. 
Were the matrix mag diagonal, there would exist conserved lepton numbers for 
each lepton flavor separately. The observed neutrino oscillations show that the 
mass matrix Mag is not diagonal, hence they reveal violation of individual lepton 
numbers. 

The mass terms (C.70) can emerge, for example, due to renormalizable Yukawa 


interaction (cf. (C.64)) 


L=- Y yoploHvyg + hc. (Cm) 
a8 


The Yukawa couplings yag have to be extremely small. In a number of the Stan- 
dard Model extensions (for example, in supersymmetric theories and GUTs), the 
smallness of the Yukawa couplings is obtained naturally due to the presence of an 
intermediate energy scale at which the effective interaction (C.71) emerges. As a 
result, the Yukawa couplings are suppressed by the ratio (or its power) of the inter- 
mediate and the gravitational scales. An illustration is the interaction involving a 
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new scalar S, which is a singlet under the Standard Model gauge group, 


S — 
L=-—- YaploHvrg + h.c., 
Mp 4 3 


where the dimensionless couplings Yag can be of order unity. If the field S' acquires 
nonzero vacuum expectation value v < (S) < Mp, then the effective renormaliz- 
able interaction (C.71) appears at lower energies, with Yukawa couplings of order 
(S ) í Mp <1. 

To conclude this Section, we note that one cannot exclude the possibility that 
there are both Majorana and Dirac neutrino mass terms in the Lagrangian, and 
that both types of masses are important for describing neutrino properties. This 
possibility, however, does not look very natural, since the mechanisms that lead to 
the two different types of mass terms, generally speaking, are different. Hence it 
is hard to expect that these mechanisms lead to mass parameters which are equal 
within an order of magnitude or so. 


C.5. Search for Neutrino Masses 


The present direct experimental limits on neutrino masses are [1]: 


My, <2 eV, (C.72) 
my, < 0.19 MeV, (C.73) 
my, < 18.2 MeV. (C.74) 


These limits are valid regardless of the type of neutrino mass. In the Majorana 
case, the constraint on the combination of neutrino masses relevant for neutrinoless 
double-8 decay of nuclei (for details see, e.g., Ref. [1]) is stronger: 


my <0.3 eV. 


For comparison, the current limit on the sum of neutrino masses, following from 
the measurements of CMB anisotropy and studies of structures in the Universe, is 
at the level 


So my, < 0.2 —0.5 eV, 


depending on which cosmological parameters are fixed from other observations. We 
discuss the cosmological aspects of massive neutrinos in the accompanying book. 

It is expected that the sensitivity of direct laboratory experiments to the mass 
of the electron neutrino will soon reach 0.2—0.02 eV (depending on the type of 
mass). Accuracy of cosmological estimates for the sum of the neutrino masses is 
also improving. 


Appendix D 


Quantum Field Theory 
at Finite Temperature 


In this Appendix we briefly describe the method of calculation of various quantities 
(free energy, effective potential, static Green’s functions) in quantum field theory 
at finite temperature. We consider predominantly the most interesting for cosmol- 
ogy case of zero chemical potentials, although the overall approach allows for an 
appropriate generalization. 

We begin with a general comment. It is sometimes useful to treat quantum 
field theory as quantum mechanics of large but finite number of degrees of free- 
dom. Indeed, field theory can be regularized by introducing spatial lattice of small 
but finite spacing (ultraviolet regularization) and considering the system in a 
3-dimensional box of finite, albeit large size (infrared regularization). For our pur- 
poses, time is conveniently treated as a continuous variable. Then fields (x, t) are 
functions of the lattice sites? and time, ¢(x, t) —> (Xn, t), where Xp are coordinates 
of the lattice site, labeled by a discrete index n = (n1, n2,n3). With this regulariza- 
tion, the number of dynamical coordinates $(Xn,t) is large but finite. In this way 
field theory reduces to quantum mechanics. 

We use this approach to obtain formal results.? Namely, we develop finite tem- 
perature techniques in quantum mechanics, and then merely extend it to quantum 
field theory. 


D.1. Bosonic Fields: Euclidean Time and Periodic Boundary 
Conditions 


Let us consider quantum-mechanical system with dynamical coordinates q = 
(q, q@),...,q). Here we assume that q are bosonic coordinates, as usual in 
quantum mechanics. Let this system be at temperature T. As is known from 


lIn lattice numerical simulations, time is also discretized. This would be inconvenient for us. 
?Gauge fields are naturally considered as living on links of the lattice, rather than on sites. This 
is insignificant for us. 

3We are not going to discuss subtle points concerning removal of ultraviolet and infrared regulariza- 
tions. In brief, no new ultraviolet divergencies or infrared pathologies appear at finite temperature 
as compared to zero temperature theory. 
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statistical physics, in thermal equilibrium the expectation values of operators at 
fixed time are given by 


(D.1) 


where the operator H is the Hamiltonian of the system, the parameter (3 is 


b= 


and trace is taken over all states of the system. Free energy F is determined by 
e OF = Tr(eP#). (D.2) 


Our first goal is to find a convenient representation for the right-hand side of this 
equality. 
Consider a system with one degree of freedom q and the Hamiltonian 


f=- +Vv@. (D.3) 


As a complete set of states in (D.3) we choose eigenstates of the operator @, i.e., we 
work in the coordinate representation. Then 


e PH J dg(qle~8"|q). (D.4) 


Here and in what follows we omit the numerical factor in front of the integral, which 
leads only to overall shift of the free energy, F — F +const. We are interested in the 
expectation values (D.1) in which this pre-factor cancels out, and also in differences 
of free energies of different phases, so the shift cancels out as well. 

Let us obtain a representation for the right-hand side of (D.4) in the form of 
functional integral, i.e., path integral in our case (for details see, e.g., Ref. [271]). 
We write 


(ale |g) = (al J] ei") 
= (q|(1— Arı - #)(1 — Ary Ë) -+ (1 — At - Iq), 


where we divided the interval of length 8 into n small segments of lengths 
Am,..., AT; we take the limit n — oo, Ar; — 0 in the end. We insert the unit 
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operator between each bracket and write 


J aaaea = f T dan Sao = i) (aol = Ara lan) 
k=0 


x (al — Ar: 2) \q2) (an| — AT Han). 


(D.5) 
Now, we make use of the relations 
V6 = / -f2 ip(q-q') 
IVl = Vll — 9) = | 5 Vide 
“2 2 
HP ig = dp P” ip(q—a’) 
wbo = f PR eves’, 
Omitting the numerical coefficient, we have for each factor in (D.5) 
E v( ja 
^ ; a eea ” F 
(ar-1|(1 — Ate : H)lak) = f ipeiazae i al (D.6) 


where we again wrote 


2 -|2 SV tag) Lae 
1- (ven) an oa red, 


The integral over dp, in (D.6) is Gaussian and can be calculated, as usual, by 
shifting pk — pr — idx, where 


This gives 
= a 
laal- Arte qe) = e7 LE TVW) Ar 


Substituting this expression in (D.5) we obtain in the limit n — oo, Ar; — 0 the 
path integral representation for the free energy, 


oor a | Dq eW Se a0), (D.7) 
a(8)=4(0) 
where 
B +2 
gs = | dt p + vo) ; (D.8) 
0 


and ġ = dq/dt. 
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Let us explain the notation introduced here. Sp is the Euclidean action of the 
system with the Hamiltonian (D.1). It is obtained from the original action 


s- fal (2) -vo 


by formal replacement 


and then considering 7 as real. The replacement (D.9) turns S into iS, so that 
er eo, (D.10) 


Further, in accordance with (D.8) the theory is considered in a finite interval of 
Euclidean time 7, whose length equals 8 = T-t. Finally, the functional integral 
(D.7) is taken over paths periodic in T with period £. 

The representation (D.7) of the free energy is intuitively clear. The operator 


e84 can be regarded as an evolution operator e~** in the imaginary (Euclidean) 
time interval tg = —2(. In accordance with this, the matrix element 
(ple? Iai) 


is path integral over trajectories in Euclidean time, which start at q = q; and end at 
q = qf. It is clear from (D.4) that the relevant trajectories are periodic, qi = qf = q, 
without any other conditions imposed on them. 

The above derivation is directly generalized to quantum mechanics of multiple 
degrees of freedom and, in accordance with what is said in the beginning of this 
Appendix, to quantum theory of any bosonic fields. Making use of collective notation 
ġ for all of the bosonic fields, we write the representation for the free energy in a 
form similar to (D.7), 


PF f Dat) eP 400, 


where the integration is performed over field configurations periodict in Euclidean 
time 7 with period Ø, the Euclidean action has the form 


sO — f r / @xLz(o, 6) 


and is obtained from the original action by formally replacing T — —iT, iS —> —Sp, 
as in (D.9), (D.10). In other words, the Euclidean Lagrangian £, in the case of gauge 
theories with scalar fields is obtained from the original Lagrangian by replacing the 


4In the case of non-Abelian gauge theories, the configurations must be periodic up to “large” 
(topologically non-trivial) gauge transformations; see Ref. [156]. This subtlety will be insignificant 
for us. 
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Minkowski metric with the Euclidean metric and by changing the signs in front of 
the scalar potential and in front of the Lagrangian of gauge fields. Schematically, 


1 a a 
lsm aT D,D po + V(¢ġ), (D.11) 
where summation over 4-dimensional indices u,v is performed with the Euclidean 
metric. 


Problem D.1. Show that the outlined procedure for obtaining the Euclidean action 
indeed leads to the expression (D.11), if the original Lagrangian in Minkowski space 
has the form 


b a mia v 
L= nF Fi, + 1 Duo! Dod — VG), 


where Fil, is the gauge field strength, ọ denotes collectively all scalar fields which 
transform according to some (generally speaking, reducible and complex) represen- 
tation of the gauge group. Hint: First, impose the gauge condition Aj = 0, and then 
restore gauge invariance in the Euclidean formulation. 


D.2. Fermionic Fields: Antiperiodic Boundary Conditions 


In the case of fermions, the functional integral representation for the free energy 
has to be derived anew. We restrict ourselves to the case of the action quadratic in 
fermionic fields, although the result will be valid for the general case as well. More 
precisely, we consider theories where fermionic part of the Lagrangian in Minkowski 
space has the form 


L = iby", — PMY, (D.12) 


where M accounts for fermion mass and interaction with bosonic fields (for exam- 
ple, in electrodynamics, M = m — ey""A,,). There can be several fermionic fields; 
generalization to this case is straightforward. Bosonic fields are considered external 
and fixed for the time being. 

Given that 7) = Wiy°, we write the Lagrangian (D.12) in the form 


L= iyoy — H, (D.13) 
where 
H = iyt ah + YPM. (D.14) 


As is seen from (D.13), py = iyt is the generalized momentum conjugate to the 
generalized coordinate w, and H is the Hamiltonian of the theory. 
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In contrast to bosonic fields, fermionic fields obey anticommutation relations; at 
equal times 


{v(x, t), V(x’, t} = {dl t), y (x, t)} = 0, 
{ a(x, t), at (x', t)} = O(x i x’). 
The latter equality is equivalent to the canonical relation {4(x, t), py(x’,t)} = 
i0(x—x’). If spatial lattice and finite spatial box are introduced, we come to quantum 


mechanics of operators obeying (in the Schrödinger representation) anticommuta- 
tion relations 


{abr Yn} z {oh oh} = 0, {bm Ph} = bmn, 
while the discretization of (D.14) leads to the Hamiltonian of the type 


Our purpose is to find the functional integral representation for Tr(e~ 84 ) in this 
theory. 

Let us consider a theory with one fermion operator w and its conjugate wt. They 
satisfy the following relations 


{b, d} = {dt wt} =0, {d,py=1. 


These coincide with the anticommutation relations for the fermion creation and 
annihilation operators. For definiteness, we assume that Ww is the creation operator. 
Then the space of states of the system has two basis vectors |0}, |1), such that 


ýt) =0, lO) = |1), 
Åt) = |0), |1) =0. 


It is convenient to consider this space of states as space of functions V(q) of the 
anticommuting (Grassmannian) variable w, whose fundamental property is nilpo- 
tency, 


Y-Y =0. (D.15) 


Let us associate the vector |0) with the unit function, Y(q) = 1, and the vector |1) 
with the function Yı (Y) = w. Then the linear space with two basis vectors |0) and 
|1} is equivalent to the space of functions of the form 


where a and 8 are complex numbers. In fact, all functions V(w) are of this type. 
This is easy to see by writing the Taylor expansion in w and using (D.15). The 
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operators w and pt act in this space as follows: 
as R o 
btl) =t), PW) = gy 9): 


It is useful to present these formulas in an integral form. We introduce the Berezin 


integral; by definition, 
fw=o, [wyn 


This definition is sufficient for evaluating the integral of any function U(w). It is 
straightforward to check by direct substitution that the following relations hold: 


v) = f ddaile WOH 0 (A), (D.16) 
juw) = | ajaj 0-H iw), (D.17) 
juw) = | dbdbleMO-OGIW(G), (D.18) 

jtv) = | abable HOO Griu) (D.19) 


where all variables and differentials Y, y, pr, dy), dt are treated as anticommuting. 
Now we are ready to write the functional integral for the quantity 


(cP W) (xh), 


where the Hamiltonian is of the form H = chi}. We proceed in analogy to the 
bosonic case and write 


eby = 1- HAn)---(1—- HAt)- ©. 
Using the formulas (D.16) and (D.19) we obtain 
(ec PA W)() 


= J II dppdypte tiO- Hwan E eTA Yn- =n) =H (Pn) Ata D (p), 
k=1 


It is useful to note that, as in the bosonic case, Wk—-1 — Yk = (Th) - At, at small 
ATp. Hence, in the limit n — oo, Ar; — 0 we obtain the functional integral repre- 
sentation, 


(Wy) = J DyDyte SË Vh), (D.20) 
where 


B 
ae J e se + mty) dr. 
0 
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Note that the functional integral in (D.20) includes integration over w; and wl at 
“initial time” + = 0 (with y; = Y(T = 0)), but does not include the integration over 
w and yÝ at “final time” T = 8. Just as in the bosonic case, the Euclidean action 
Sx is obtained from the action in real time 


S= / dt(ipt ðb — H) 


via the formal substitution t > —iT, iS — — Sp. - 
It remains to find the boundary conditions leading to Tr(e~°”). Let us write 


(ec B)(¥) = J daUo bi) (di), (D.21) 
where 
Oe J Dypp S? 


and prime means that the integration is not performed over the initial “value” 
p(T = 0) = yi. (This integration is left for (D.21).) The general expression for the 
function of two Grassmannian variables reads 


U (Y, pi) = uo tury + u_1y; + uppi. 


Then we obtain 
[eve -l= u — u, [uuu Wi) pi = uo — up. 
In the operator language this means 
e° 0} = u_1|0) — wall), e7P#|1) = uol0) — u11). 


Consequently, 
Tr(e PË) = u-u = J iuw 
So, we finally get 
Tr(e-2’) = DyDote-Se, 
%(8)=—(0) 


i.e., the integration is performed over Grassmannian trajectories with antiperiodic 
boundary conditions on ~(7) in the interval (0,3). The variable yt(T) can also 
be considered antiperiodic: in the interval (0,3) any function yÝ (T) is represented 
as a sum of periodic and antiperiodic functions, and the periodic part does not 
contribute to S£ ) since it is convoluted with antiperiodic (r). 

The above derivation is fully applicable to the systems of many fermionic degrees 
of freedom, and, consequently, to the theory of fermionic fields. Here an important 
role is played by the relations (D.17) and (D.18) which we have not used so far. 
Although our derivation is given for the case where bosonic fields are external, it 
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is easy to understand that this is actually not a limitation: in a theory with both 
bosonic and fermionic fields, the integral over fermions can be considered as internal 
(there bosonic fields are fixed), and then the integral over bosonic fields is evaluated. 
Thus, the free energy is given by the integral 


e OF = 7 = J DEDY Dye SE, (D.22) 


where sP is the Euclidean action in the interval (0,3), and bosonic fields @ 
(fermionic fields Y, Yt) satisfy periodic (antiperiodic) boundary conditions there. 

To conclude this Section, we note that the formalism can be generalized to the 
case of nonzero chemical potential. In general, the chemical potential is introduced 
when the medium has nonzero density of conserved (at given temperature) quantum 
number. In the cosmological context, the baryon and lepton numbers are of the 
greatest interest. The corresponding operators have a structure like 


Q= | Exi. 


The nonzero average density n = y?y is taken into account by the additional term 
(—pQ) in the effective Hamiltonian, i.e., 


where u stands for the chemical potential. Within the formalism considered, this 
leads to the following change of the Euclidean action, 


B z 
SP > gs?) — uf dr | dxiyv. (D.24) 
0 


The partition function is then again given by (D.22). In this case, the quantity 
F(T, u) is called the Grand potential or Landau potential. 


D.3. Perturbation Theory 


The approach described in Secs. D.1 and D.2 is useful for calculating the free 
energy, the effective potential Veg(TI,¢) introduced in Chapter 10, as well as the 
static Green’s functions. The latter characterize the response of the system to time- 
independent external probes. For example, suppose that static source J(x) is intro- 
duced into a theory of quantum field ĝ. This implies the following modification of 
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the Hamiltonian 


H >H- | J(x)d(x)d®x = H,, 


where (x) is the Schrédinger field. In the presence of this source, the partition 
function 


Z, =e FFs = Tr(e P45), 


is represented in the form of functional integral (D.22), and its expansion in J has 
the static Green’s functions as coefficients, 


G(x1,...,Xn) = z= f pes” W 
B 


B 
x af dt, 0(X1,71) X +++ x sf dtn(Xn,Tm). (D.25) 


(The normalization by partition function Z without the source and factors 67t are 
introduced for convenience. ); the subscript E in the notation of the Euclidean action 
here and below is omitted. 
A simple example is the field expectation value in the presence of the static 
source, 
_ Tr(e7?¥74(x)) 
(x); = ret) 


To the leading order in J it is equal to (assuming (¢) ;=0 = 0) 


(6(x)),) = I G(x,y)J(y)dy. 


The difference between G(x, y) and the free propagator at zero temperature corre- 
sponds to the modification of the Coulomb or Yukawa law in the presence of the 
medium. 

Note that the static correlation functions (D.25) do not represent all interesting 
classes of Green’s functions. Computational technique for correlators at different 
times (for instance, the Keldysh method) is quite complicated and not required 
here. 

To generalize (D.25) we consider the Euclidean Green’s functions 


G(x1, Tij... Xn, m) = Z= f Des” Molan, n) ...6(XnsTn)s (D.26) 


where ¢ denotes collectively all fields in the theory, and the integration is performed 
over the bosonic and fermionic fields, periodic and antiperiodic in the interval [0, 6], 
respectively. The normalization factor Z is given by a similar integral; see (D.22). 
Proceeding from the representation (D.26), it is straightforward to construct the 
diagram technique for perturbative calculations, similar to the Feynman technique 
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in theories at zero temperature. As usual, we first consider free theories with sources. 
In the scalar and fermion cases, the expressions for the quadratic actions are 


Hie 
sP = [ dr | èx E ð pð p + — 5 —p — Joy], (D.27) 
p = f dr | dx [py 3 t + moby — Jyp — YIy] - (D.28) 
0 
Here z? = 7, summation over indices is performed with the Euclidean metric, and 


the Euclidean y-matrices are Hermitean and obey the rule {y", y” } = 6”. 


Problem D.2. Check that the Euclidean action of the free Dirac field with a source 
has the form (D.28). 


As the field ọ is periodic in 7 with period 8, so is the source J,(x,7). And vice 
versa, Jy, and Jy are antiperiodic. 

The functional integral (D.22) with the quadratic action and source (D.27) is 
Gaussian and is calculated by shifting y(x,T) — y(x,T) + Yc(x,T), where the 
function y-(x,7) is the solution to the equation 


One +My = Jo. (D.29) 


According to our prescription, Ye has to be periodic in 7 with period 8. One writes 


B 
al XT) =| ar! [dx Doxr3x's7!)Jo(X,7"), 
0 


where D is the free propagator at finite temperature. Given the periodicity of Jy, 
it is straightforward to see that Eq. (D.29) and periodic boundary conditions are 
obeyed if the free propagator has the form 


E eiP(*— x')+iwn(T—T') 
D(x, T; x’, T") Tog ki p? + w2 + m? 2 
where 
2 
Lm = n =0,+1,+2,... (D.30) 


are the Matsubara frequencies of bosonic fields. In contrast to the field theory at 
zero temperature, the frequencies form a discrete set. 

The free propagator for the vector field is constructed similarly and is also a 
sum over frequencies (D.30). 
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In the case of fermionic field, the analog of Eq. (D.29) is 
yaupe + Mpe = Jy. 
Both Jy(x,T) and w-(x,7) are antiperiodic in 7 with period 8. The solution is 
B 
We(X,T) = f ar! f dx! S(x,7 ee ale TP). 
where the free propagator is given by 


S(x, T; xT) 


1 iPwn’ IYPEM pix! 45 ae, 
= Bp — E T "iPx Hie (T-T), 
(27)36 2 ; J P- po +m 
n'=+4,+ł4,. 
(D.31) 
Here 
Qrn! Td 

Wn = - ; n’ = z’ >? see (D.32) 


are the Matsubara frequencies for fermions. The fact that n’ runs over half-integer 
values is obviously due to the antiperiodicity of fermionic fields. 

Further development of the diagram technique proceeds along the same lines as 
in the (Euclidean) field theory at zero temperature. The expressions for interaction 
vertices in theories at T = 0 and T ¥ 0 coincide. Since integration over dr in 
the action runs from 0 to 8, instead of the 6-function of energy conservation the 
following factor appears at every vertex, 


Bå Os w) l (D.33) 


where ` w is a sum over the Matsubara frequencies of all lines considered as incom- 
ing (frequencies (D.30) and (D.32) for bosonic and fermionic lines, respectively), and 
the function (X w) equals 1 if $` w = 0 and zero in all other cases. 

Note that turning on chemical potential, according to (D.24), leads to the 
replacement 09 — Oo — y in the action (D.28). The corresponding change in the free 
fermion propagator (D.31) is the replacement 


Wn! > Wy + Ob 


in the pre-exponential factor of the integrand in (D.31), while the Matsubara fre- 


? 


quencies remain intact in the exponential factor expļliwn (T — 7’)]. 


D.4. One-Loop Effective Potential 


As the first example of the use of the technique described in Secs. D.1 and D.2, let 
us re-derive the expression (10.24) for the first temperature-dependent correction 
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Fig. D.1. Some diagrams contributing to the first order correction to the effective potential. 


to the effective potential. Our aim is to calculate the free energy as function of the 
homogeneous background scalar field ¢, neglecting interactions between particles in 
the medium. Our starting point is the formula (D.22) for the free energy. In our 
approximation, the action S{®) is quadratic in quantum fields, and the background 
field @ enters into it only through the particle masses. The integral (D.22) is fac- 
torized into a product of integrals over different fields, so the free energy has the 
structure (10.22) indeed. 

Note that in perturbation theory, the higher-order corrections to the effective 
potential are given by diagrams without external lines, where masses and vertices 
depend on the background field ¢. The simplest of these diagrams are schematically 
shown in Fig. D.1. These diagrams begin with two loops. Therefore, within the 
described formalism, the zeroth order is naturally called one-loop approximation. 

Turning back to the one-loop approximation, consider, for example, the contri- 
bution of the scalar field, whose action is given by formula (D.27) with J, = 0. The 
integral over y of the type (D.22) is Gaussian and equal to 


J Dye S#l = [Det(—A,,0, + M], 


where m? = m?(@), and the determinant can be understood as the product of 
eigenvalues of the operator (—0,,0,, + m?) with boundary conditions of periodicity 
in 7 with period p. If the system is put into a spatial box of large size L, then the 
eigenvalues are 


2 2 2 
Anny ,nasns =p rw, +m, 


where 


2 2 2 
p= ( =, a =] ’ 11, N2, N3 E Z, (D.34) 
and wn are Matsubara frequencies (D.30). Therefore, the contribution to the free 
energy reads 
1 i P? +w? +m? 
=D D goe [PH], 
nm N1,N2,N3 

where the parameter A makes the ratio dimensionless. This parameter leads only 
to an overall shift in the free energy, and hence it is insignificant. In the limit of 
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large L 


n1,nN2,N3 


so that the free energy is indeed proportional to the volume, and the contribution 
to the effective potential is 


1 dp pP? +w? +m? 
Ea apes | a] ma 


This expression is divergent at large p and/or n. We assume that it is regularized 
(see also Sec. 10.4) and proceed to work with it as if it was finite.” 

To calculate the sum over all integer n, we note that it can be represented in 
the form 


>X un) = ~ f cot(rz)u(z)dz, (D.36) 


c N c O 


where integration is performed along closed contour in the complex plane encircling 
the real axis counterclockwise; see Fig. D.2(a). To prove the formula (D.36), it is 


sufficient to note that cotmz has poles at integer z = 0,+1,... with residues equal 
A Imz Amz 
aS 
Rez Rez 
= = 
$$ — 
(a) b) 


Fig. D.2. (a) The contour of integration in Eq. (D.36); (b) Deformed contour. 


5Particularly convenient for this general treatment is Pauli—Villars regularization; see, e.g., the 
book [268]. 
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to m71. In our case 


— (D.37) 


u(z) = log ( 5 


This function has branch cuts at positive and negative semi-axes starting at 


z= sik PETA, 
F; 


so the contour of integration in (D.36) can be deformed as shown in Fig. D.2(b). 
On the positive imaginary semi-axis, the jump across the cut is 


u(ig — e) — u(i + €) = —2ri . 


We recall that cot(inÇ) = —icoth(7¢) and find the contribution of the cut along 
the positive semi-axis: 


_1 f Ëp 1 
28 J (27)? J E Spn 


peonte = 
7) 


dÇ mcoth(x¢) . 
The same contribution comes from the cut along the negative imaginary semi-axis. 
We perform the integration over ¢ and discard the contribution from the upper limit 


of integration and finite terms, which do not depend on parameter m. (Recall that 
we pretend that the integral is regularized.) Thus, 


1 d? . 
f= Ta [sinh (Sverre) | : 


This expression can be represented as 


where 


1 a 
does not depend on temperature, while the temperature-dependent part is 


= 5108 (1 = er) l (D.39) 


(2 


The zero-temperature contribution (D.38) to the effective potential is simply the 
sum of zero-point energies of oscillators of the field y, 


1 1 
fe(T=0)= Fy 5 IVP +m’, (D.40) 
N1,N2,N3 


where for clarity we returned back to the theory in finite spatial volume; the momen- 
tum p is given by (D.34). The effects of this contribution are discussed in Sec. 10.4. 
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The contribution (D.39), relevant at finite temperatures, is, in fact, the free 
energy of ideal gas of bosons [78]. To see explicitly that it is equal to (— pressure), 
we write Eq. (D.39) as the integral over p = |p| and integrate by parts: 


1 p°dp = 2 2 
TF= Bv/p2+m 
i= 5 f 27? log (1-e : ) 


1 °° Darn 
= rb log (1 —e rE 
Op 


~ 6q2 
61? Jo 


SS ptdp 1 
~ Gr? Jo Jp? bm? P 


which is precisely the boson integral (10.24) with g; = 1. (We consider one real 
scalar field vy.) 

The calculation of the fermionic contribution to the one-loop thermal effective 
potential proceeds in a similar way. For the theory with the action (D.28) and 
Jg = Jy = 0, the functional integral (D.22) equals 


(8) 


J DbDype sv = Det [7“0,, + M(H), 


where the eigenfunctions of the Euclidean Dirac operator must be antiperiodic in 
7 with period 8. For fixed 3-momentum p and Matsubara frequency (D.32), there 
are two doubly degenerate eigenvalues of the Dirac operator, 


At = mip? +2. 


As a result, for each momentum we have the factor (A;A_)? in the determinant, 
and instead of (D.35) we obtain 


2 d°p p? +w, +m? 
= —— | —— In | ——————__ |. D.41 
f af F, 33 a| i% | ai 


We emphasize that the difference in sign as compared to (D.35) is due to the fact 
that we are dealing with fermions. When calculating fy, we encounter the sum over 
half-integer n’, which can be written as 


| 2 u(n’) = -5 f tan(rz)u(z)dz, (D.42) 


where the integration contour is the same as shown in Fig. D.2(a), and u(z) is 
still given by formula (D.37). Further calculation basically repeats the calculation 
for the scalar field. The contribution of fermions also has zero-temperature and 
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finite-temperature parts. The former, 


3 
fe(T = 0) = -2 f a VP HM?, (D.43) 


can be interpreted as the contribution of the Dirac sea (negative energy states, 
w = —\/p? + m?, doubly degenerate at each p). The temperature-dependent term 


fs = -5 [Bios (1 + eTe prt?) 


is the free energy of ideal gas of fermions [78] and coincides, upon integration by 
parts, with the fermion integral (10.24), given that the total number of spin states 
of fermion and antifermion is g = 4. 

Thus, in the framework of the formalism presented in this Appendix, the dif- 
ference between the Bose and Fermi statistics manifests itself, in particular, in the 
difference between the Matsubara frequencies (D.30) and (D.32). In Sec. 10.3 we 
discuss the importance of this difference for the infrared properties of the theory at 
high temperatures. 


Problem D.3. In the one-loop approximation, find the Grand potential and fermion 
number density of fermionic matter at chemical potential and temperature T. 
Consider the limiting cases T > u > mandT & u. Hint: Make use of the property 
OF (u, T) 
Ou 
This property follows from (D.23). 


= —(Q) rp 


D.5. Debye Screening 


As the second example, let us consider the one-loop contribution Iy to the photon 
self-energy in quantum electrodynamics at finite temperature and zero chemical 
potential; see Fig. D.3. As usual, it modifies the photon propagator 


Dio (DO eit) a 


where I,» is also called the photon polarization operator. Let us consider the static 
propagator; see (D.25). So, we are interested in the self-energy at zero Matsubara 


T= pu v 


Fig. D.3. One-loop photon self-energy. 
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frequency, 


Map) = Tis (D, Wn = 0). 


The sought-for contribution modifies the static Maxwell equations, which in the 
medium obtain the following form (in momentum space) 


p’ Ao + Too Ao + Moi Ai = jo, 
p’A; — pipA +IlioAo + Win An = ji, 


where j,,(p) are time-independent charge and current densities. 

Before doing the calculation, we note that due to gauge invariance of electrody- 
namics, which holds in the presence of matter as well, the self-energy I,.,(p, wn) 
has to be transverse, 


pill = 0, 


where p = (wn, p). Lorentz-invariance is explicitly broken by the presence of mat- 
ter, but the symmetry with respect to spatial rotations remains intact. Therefore, 
the general structure of the self-energy is 


Hoo = 1), 
—  PiPON(E 
Hio Te p? Tr ); 
2 
_. _ PiPjP0 (E) _ _ PiPj (M) 
a a a + (0) | 


where “electric” and “magnetic” terms, IH) and II), depend on p? and po = wn. 
In the static limit po = wn = 0, only Noo and transverse part of Il;; are nonzero, 
thus the modified Maxwell equations take the following form, 


(p? + I) Ao = jo, (D.44) 


(p? + 1) (5: = tbe \ Aj = jk. (D.45) 


We will be interested in the field behavior at large distances. Hence, in the end we 
take the limit p? — 0. The order of limits is important here: one should first set 
Po = wn = 0, and then take the limit of small p?. 

The interaction of fermions with electromagnetic field is introduced, as usual, 
by the replacement ô, —> ô, — ieA, in the action (D.28), so the diagram in Fig. D.3 
gives 


(i) 


I” (p, p') = —e? / tr "S(x, y)” Sly, a) eX 2 e728" badty, 


where p” = (wn,,p™), pt = (wn, p) are momenta of incoming and outgoing 
photons, integration over 2° and y? is performed in the interval (0, 8), and the 
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+ 
Fig. D.4. The interpretation of self-energy in terms of photon rescattering in medium. 


fermion propagator is given by (D.31). Extracting the 6-function of energy and 
momentum conservation (The former is understood in the sense of (D.33).) and 
setting Wn; = Wn, = 0, p® =p) = p, we obtain 


2 


TP) = om | * De P 


where ĝ = "qu, qo = Se, photon momentum is equal to p” = (0,p), the sum 
is evaluated over halfinteger n’, and momentum squared is understood in the 
Euclidean sense. We again perform summation over the Matsubara frequencies by 
making use of (D.42). Taking the limit of small photon momentum, we arrive at 


oo E —SwhG5 +9? +m?) 


Poles of the integrand are at qo = +i\/q?+m?; they show that in fact we are 
dealing with the photon forward scattering off fermions and antifermions existing 
in the medium; this is schematically illustrated in Fig. D.4, where crosses denote 
particles in the medium. Here we encounter the situation where nominally one-loop 
calculation corresponds to tree-level diagrams for scattering off the medium, while 
the interaction between the particles of the medium is not taken into account. This 
situation is analogous in some sense to that in Sec. D.4. (Formally one-loop calcula- 
tion of the effective potential corresponds to the approximation of non-interacting 
particles in the medium.) 

Integrating over dqo in (D.46) and omitting temperature-independent terms, we 
obtain for the 00-component 


dq utr 


(D.47) 
et + i 


IIopo(p > 0,w = 0) = II eof 2 
where wq = yq? + m?. (When performing this calculation, it is convenient to make 
use of the integration by parts analogous to that used in (D.42).) At the same 
time, the spatial components II,;; are equal to zero in the limit of small photon 
momentum, 


TO) (p > 0,w = 0) =0. 


Recalling (D.44) and (D.45), we see that our results imply that electric field gets 
screened in the medium, while magnetic field does not. (Of course, the latter prop- 
erty has been demonstrated within one-loop approximation only.) Indeed, in the 
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coordinate representation, the solution to Eq. (D.44), in the case of point charge q 
placed at the origin, has the following form at large distances, 


dp etPX q e7 ™plx] 
A p i ———— = 
oc) =a f Qn ptm in N 


where m?, = IŒ) (p — 0,w = 0) is the Debye mass squared. The exponential 
fall-off here precisely means screening of electric field. For the magnetic field this 
phenomenon of exponential fall-off at large distances is absent. 

Note that at T < m, the Debye mass is exponentially small (this is because 
we consider medium at zero chemical potential; the density of fermions is thus 
exponentially small at low temperature), while in the opposite limit 


m= ET, TM, (D.48) 
T 


i.e., the Debye screening radius rp = m}! decreases with temperature. 

At the end of this Section, we mention that the Debye screening arises also 
when the medium contains charged bosons rather than fermions; the contribution 
of bosons to the Debye mass squared is of the same order of magnitude as that of 
fermions of the same mass and electric charge. 

Finally, the Debye screening occurs at nonzero net particle densities (nonzero 
chemical potentials), too. The temperature in that case may be low and even zero, 
the Debye radius is then determined by the density of charged particles. A partic- 
ularly interesting example here is electron—proton medium. 


Problem D.4. Find the Debye radius in an electrically neutral electron-proton 
medium at given electron number density and temperature T in two cases: mp > 
T > Me and Mme > T > A, where A is the binding energy of electron in hydrogen 
atom (A = 13.6eV). Hint: Perform the calculation at fixed chemical potentials of 
electrons and protons; in order to find their relation to the electron number density 
use the results of Problem D.3. 
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